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PREFACE 


By the term "action at a distance,” we mean a relativistically invariant 
particle interaction, not the instantaneous action at a distance of Newton. 
It is customary in physics to use such interactions in some problems, for 
example, scattering, but the usual point of view regards interparticle action 
as only a part of a wider interaction of fields and particles. Fields are taken 
to possess further degrees of freedom that are not contained within the 
interparticle action. Such wider degrees of freedom are called on in other 
kinds of problems, to explain the decay of excited atomic states, for example. 

Our purpose in this book is to argue that all the results of physics, even 
those which are usually thought to arise from the independent degrees of 
freedom of fields, can be obtained from interparticle action—for decay 
problems and scattering problems alike. In developing this point of view, 
we have concentrated throughout on electrodynamics and gravitation, be¬ 
cause these parts of physics, being more complete than particle physics is 
at present, would seem to provide the most stringent test of our thesis. At 
the end, however, we consider how an apparently critical difference between 
long-range and short-range interactions can be resolved—an issue relevant 
to extending the theory to strong and weak interactions. 

The essential difference between field theory and what might be termed 
direct-interaction theory is that one can consider local systems in isolation 
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from the rest of the universe in the former, but not in the latter. The 
independent degrees of freedom of fields in field theory are logically equiv¬ 
alent to the interaction of local systems with the universe in the direct- 
interaction theory. Since the equivalence is found to operate in certain 
cosmological models but not in others, the equivalence is by no means just 
a mathematical transformation of one theory into the other. We learn 
important aspects of the cosmological structure of the universe from it. 
“Cosmology” appears in the title of the book for this reason. 

Although we have tried to present the argument in a self-contained way, 
not drawing extensively on outside results, the style presupposes for the most 
part that the reader is already familiar with “the usual theory,” by which 
we mean field theory. The book is based on a series of lectures given at 
the NATO summer school, Erice, Sicily, 1972. We have incorporated the 
lecture format into the structure of the book because we found it permitted 
us to divide the treatment into fairly individual packets. Some authors 
apparently think it a virtue to write a book like a single long mathematical 
proof, where one cannot read a part without reading the whole. We ourselves 
find such expositions discouraging. In our experience, the most useful books 
are those in which it is possible to jump fairly freely from one place to 
another. Ideally, a book should be multidimensional in its construction. We 
hope the lecture format has helped somewhat toward this goal. Particularly, 
we have had in mind the reader who may wish to jump over the details 
of quantum electrodynamics, and perhaps over the formal aspects of Chap¬ 
ter 5. Such a reader, especially if interested more in gravitation and inertia 
than in electrodynamics, may wish to proceed directly from Chapter 3 to 
Chapter 6. This is why Chapter 6 begins in a more subdued vein than one 
might expect from the style of Chapters 4 and 5. 

The authors wish to thank Rose Stratton and Evaline Gibbs for their help 
in preparing the manuscript. One of us (J. V. N.) is happy to acknowledge 
the award of a Jawaharlal Nehru fellowship. 


F. H. 


J. V. N. 
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PHYSICS AND COSMOLOGY 



INTRODUCTION 


In this book we shall be concerned with seeing how the structure of the 
universe may be relevant to physics. The current and conventional point 
of view is that, although the universe probably sets important boundary 
conditions for the operation of physical laws, the laws themselves are inde¬ 
pendent of large-scale structure and could in principle be determined by 
entirely local experiments. This book takes the opposite point of view, that 
the physical laws, as we usually state them, already involve the universe 
as a whole. 

In many problems it is possible to “decouple” the effect of the universe, 
in the sense that the influence of the universe remains effectively constant 
throughout the limited spacetime volumes that these problems concern. We 
shall, for example, regard particle masses as arising in the manner of Figure 
1.1, which shows the paths of two particles, a and b. A biscalar propagator, 
satisfying a certain explicit wave equation, G(A , B ), connects every pair of 
points A , B, on these paths. The mass of particle a , at the point A , is 
determined by a suitably defined summation of G(A,B) both with respect 
to variable B on the path of b and with respect to all particles b in the 
universe. It happens in many problems that the resulting mass, which can 
be denoted by m a (A), is almost completely determined by particles b that 
are very distant from a , and that m a (A) is effectively independent* of the 



G(A, B) 



FIGURE 1.1 

The mass of particle a at the 
point A arises from interactions 
with other particles b , the inter¬ 
action being carried by the bisca¬ 
lar propagator G(A, B), which is 
integrated over all points B on 
the world line of particle b. 


position of A within localized spacetime volumes. This independence permits 
us to think of m a (A) as a constant—say, m —in these problems. In such cases, 
we can consider particle a to obey the usual localized rules, whether classical 
or quantum, as in the Dirac equation 

dx l ’ 

where m is thought of as “belonging” to the particle. Such problems gain 
greatly in simplicity because the influence of the universe need not appear 
explicitly, provided we regard m as being empirically determined. This gain 
is at the expense of being able to determine m theoretically, however. 
Physics, working locally, seems incapable of explaining particle-mass values. 

If the reader will grant for the moment that such a point of view is correct, 
it will be clear that problems in which the universe exercises a constant 
environmental influence are likely to be more tractable than problems in 
which the environmental influence is not constant. Problems where the 
influence of the universe can be replaced by certain empirical choices, as 
with particle masses, are exactly the problems in which the most spectacular 
advances are likely to be made. Since it is the sensible practice of the 
physicist to concentrate heavily on those problems where progress can be 
made, a situation is only too likely to arise in which all solved problems 


(f + im )\p = 0; f = y 
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are of the decoupled kind. This situation in turn is only too likely to promote 
the point of view mentioned at the outset. It is easy to see that, by selecting 
for solvable problems, one could be led to the erroneous position of sup¬ 
posing that the physical laws were purely local—although many empirical 

choices would be needed to make such laws ‘'work,” just as many empirical 
choices are indeed made in modern physics. 

The decoupling of local systems from the universe is not always as con¬ 
ceptually simple as it is for particle masses. The manner in which decoupling 
must be made in order to obtain correct results may be well-known, as it 
is in thermodynamics, but a sense of mystery remains. Such parts of physics 
are regarded as containing something uncomfortably odd. 

In classical physics, and in quantum problems involving particles but not 
antiparticles, we are concerned with calculating how a state P, given at time 
t , changes into a state Q at time t + At, where At is a specified interval. 
The states P y Q , depend on the particle positions and momenta, and possibly 
on certain field quantities. (In quantum theory there are of course amplitudes 
for these variables, whereas in classical theory the variables take explicit 
values.) Provided the laws governing the change from P to Q are time- 
symmetric, one can obtain from Q a state R(Q) with the property that a 
system with state R at time t goes to P at time t + At. For example, in 
classical mechanics R(Q) is obtained from Q by reversing the signs of the 
momenta of the particle. Hence the class of all “final” states at time t + At 
contains all “initial” states at time t. No time-sense can be determined in 
such a system. There are no irreversible processes. 

Why then in the world of experience do we find the irreversibility of 
thermodynamics? A poor answer, but one that has often been given, argues 
that, because we cannot (for lack of technique) work through all the details 
that are involved in going from P to Q for many-particle systems of great 
complexity, some new mysterious factor has been overlooked. The argument 
is that, were it possible to make such explicit calculations, a miraculous 
principle, not apparent for simpler systems, would emerge. 

A second answer, one that has a closer relation to the world of experience, 
turns on restricting the class of initial states. Practical thermodynamics 
considers the state P to be that of a local system, not of the whole universe. 
The variables that define P therefore depend in part on the local system 
itself and in part on the manner in which the system is related to the outside 
world, and this dependence inevitably restricts the variables to the values 
which are actually possible in our particular universe. This restriction means 
that we do not have access to the whole class of all possible initial states 
but only to a subclass, say, %. This subclass of states at time t defines a 
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subclass, say, £, of states at time t + At. and these two subclasses need not 
be the same. Thus a P belonging to JC going to a Q belonging to £ may 
define an R(Q) that does not happen to be realizable in our universe, i.e. 
which does not belong to 9i\ We now have a time-sense, but one that owes 

its origin to the restrictions imposed by the universe. 

A third answer is to deny the time-symmetry of the basic laws. The slight 
difference in the decay rates of the K °, K °, mesons represents a small 
departure from time-symmetry. If this asymmetry is interpreted as basic to 
the physical laws, one might seek to relate all other asymmetries, such as 
that of thermodynamics, to the behavior of the K °, K °, mesons. We ourselves 
regard this as a possible, but not a promising, approach to the problem. 
We think it better to seek to explain the K°, K°, difference in terms of a 
particle-antiparticle asymmetry of the universe. 

So far as we are aware, no other suggestion for explaining thermodynamic 
irreversibility has been proposed. Of these three suggestions, only the second 
seems at first sight to provide firm ground. Yet there is a most important 
aspect of the third suggestion, connected not with mesons but with electro¬ 
magnetism, which must also be considered. Although the statement of the 
laws of electromagnetism is fully time-symmetric, all practical applications 
of the laws are asymmetric. The laws are made to yield asymmetric results 
by means of an ad hoc restriction to only time-retarded solutions of Max¬ 
well’s equations. This curious circumstance presents us with two alternatives. 
First, we can argue that the choice of time-retarded solutions is a basic 
principle of physics which we do not seek to explain. We then face two 
possibilities, which depend on whether we consider the empirically observed 
asymmetry of the universe to be derived from, or independent of, this new 
principle. The second possibility would require us to attribute thermo¬ 
dynamic irreversibility to a combination of two independent asymmetries, 
one belonging to the universe, the other to the time-retarded principle. The 
first possibility would require us to attribute all asymmetries, even those 
of the universe, to time-retardation. 

The second alternative is quite different in principle, and amounts to 
inverting this first possibility: we can attribute both thermodynamic irre¬ 
versibility and time-retardation to the structure of the universe—and this 

is the point of view we follow in this book. This alternative has the attraction 
that the need for imposing time-retardation as a basic principle is obviated. 
Surprisingly, we shall find that time-symmetric solutions of the electro- 
dynamic equations can yield all the observed effects, provided local problems 
are properly related to the universe and provided the universe has an 
appropriate large-scale structure. This last requirement shows that, although 
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the theory we will develop yields all the usual results, of both classical and 
quantum electrodynamics, it is not equivalent to the usual theory. The usual 

theory does not tell us anything about the large-scale structure of the 
universe. 

Much of what has just been said has been known, at least for classical 
electrodynamics, from work of Wheeler and Feynman done more than 
twenty years ago. The classical theory already shows how the unwanted 
advanced solutions of Maxwell's equations are canceled by what Wheeler 
and Feynman called the “response of the universe." Our particular task will 

be to show how those quantum phenomena which are usually taken to arise 
from the zero-point fluctuations of the quantized electromagnetic field can 
also be explained in a fully time-symmetric theory in terms of the response 
of the universe. Much of the first half of the book is concerned with this 
task. Readers who are more interested in sections dealing with the masses 
of particles and with general relativity may prefer to pass rather lightly over 
the quantum electrodynamics. Flowever, we consider the discussion of the 
latter to be crucial, since we would not like to elaborate the concepts of 
this book unless they were first shown to be capable of recovering the 
best-developed part of modern physics. 

So far, we have discussed two important aspects of the world, thermo¬ 
dynamic irreversibility and the need to choose retarded solutions of Max¬ 
well's equations, both of which fit uneasily into a purely local theory. Are 
there other practical issues which also fit uneasily, or not at all, into a local 
theory? 

Many scientists have remarked on a curious apparent coincidence involv¬ 
ing the physical constants and the Hubble constant H determined from the 
redshift-distance relation 



taken for galaxies whose distances d are not so large that the regions ol 
spacetime in question are markedly non-Euclidean. Here A\ is the wave¬ 
length shift for a spectrum line of laboratory wavelength A. The coincidence 
in question appears most clearly when the velocity of light c and Planck’s 
constant fi are set equal to unity, so that redundant units are suppressed. 
With c — fi — 1, spatial lengths and time intervals have the same unit, and 
particle masses become inverse lengths. The length unit is indeed best 
determined from the mass of some particle, from the t rr ± mesons, for example, 
as, say, All physical quantities then have dimensionalities which can 

be expressed in terms of a combination of m^ with certain dimensionless 



6 


1 INTRODUCTION 


numbers. The latter arise either from particle mass ratios or from particle 
interactions which serve to define four coupling constants, a v a 2 , a 3 , a 4 . 
For strong interactions, a x 15; a 2 — e 2 = ^ is the fine-structure constant 
equal to the square of the elementary charge; a 3 ~ 10 -5 for weak inter¬ 
actions; and 


a 4 = Gm* ~ 10" 40 

for the gravitational interaction, G being the Newtonian constant. Physical 
processes involve integral powers, not fractional powers, of these coupling 
constants; so the surprise of finding a physical constant as small as a 4 cannot 
be avoided simply by replacing a 4 by a small fractional power of a 4 . The 
“coincidence” can now be stated. It is 

H/m^ ~ a 4 , 

meaning that the ratio of the local length scale m^~ l to the cosmological 
length scale H~ l is also of order a 4 . Because the usual local formulation 
of physics throws no light on this “coincidence,” many modern cosmologists 
have eschewed an attempt to understand it. For our part, however, we 
believe the apparent coincidence of these numbers to be another indication 
that physics should not be formulated in a purely local way. 

As another example of an issue that fits uneasily into a local theory, we 
mention here the concept of an “origin” for the universe. The meaning of 
this concept will be a major topic in the second half of the book. 
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CLASSICAL THEORIES OF 

ELECTROMAGNETISM 
AND GRAVITATION 


§2.1. HISTORICAL DEVELOPMENTS (LECTURE 1) 

Theoretical physics, as we know it today, really began with Newton’s laws 
of motion and gravitation. The remarkable work of Copernicus, Kepler, 
Galileo, and others before them had provided a wealth of observational 
material on phenomena in the solar system and on the Earth. These seemed 
to show that nature follows some definite set of rules and patterns. Could 
these, in turn, be explained by simpler and fewer assumptions? An affirm¬ 
ative answer was given by Newton’s laws. Phenomena that had seemed 
diverse, like the pendulum, the falling apple, and the motion of planets, 
were shown to follow the same set of laws. 

A wrong law often explains all existing phenomena, but breaks down 
completely as new phenomena are uncovered. Newton’s laws, however, were 
able to surmount this hurdle successfully for two centuries, and consequently 
came to dominate the course of theoretical physics until the end of the 
nineteenth century. An example of this dominance is found in the origin 
of the electromagnetic theory, and it is interesting to follow the development 
of this theory alongside that of gravitation. 

The law of gravitation appears as an action-at-a-distance statement, in 
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which particles with masses m v m 2 , attract each other by a force 



G 


m 1 m 2 


9 



where r is their distance apart and G a constant. Coulomb’s law of attraction/ 
repulsion between two electric charges with magnitudes e v e 2 , or magnetic 
poles n v jit 2 , distance r apart, follows the same pattern: 


Foe 


g l g 2 

r 2 


Fee 


M1M2 



where + denotes attraction and — denotes repulsion. 

Repulsion as well as attraction is possible because charges and poles can 
exist in two species. In this respect (2) differs from (1): this difference is 
crucial, and led to very different developments of gravitation and electro¬ 
magnetism. Since all masses attract gravitationally, bigger masses tend to 
dominate smaller masses. Therefore one is necessarily led to study the 
interactions of large bodies, like planets, satellites, or stars. To detect influ¬ 
ences of gravitation on a laboratory scale is extremely difficult and requires 
very sensitive equipment. The situation is reversed for electromagnetism, 
since large objects like planets or stars are electromagnetically neutral, and 
since it is easy to produce isolated charges and magnetic dipoles in the 
laboratory. Thus (2) was studied very extensively in the laboratory. 

Although initially there was a good agreement between (2) and the 
laboratory experiments, more sophisticated experiments began to show that 
(2) does not represent the whole truth. In particular, phenomena associated 
with sufficiently rapidly moving charges are not explained by (2). It is true 
that (2) can be patched up to make it agree with experiment by adding 
further terms to the righthand side, but the resulting expression looks very 
inelegant. Moreover, the similarity with (1) which initially led to (2) is then 
lost. 

The solution of the electromagnetic problem came in an altogether differ¬ 
ent way, with the introduction of the so-called electromagnetic field. This 
entity is supposed to surround every electric charge and convey an influence 
from one charge to another. For example, when charge e x moves, it disturbs 
the field in its neighborhood. This disturbance propagates outward with finite 

e 2 . The disturbance is then commu¬ 
nicated to e 2 , which begins to move. The propagation of such disturbances 
in the field was described by waves. The action-at-a-distance picture came 


velocity and eventually reaches charge 
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therefore to be replaced by one of particles and fields. The mathematical 
structure for this was provided by Maxwell. 

The finite velocity of propagation in the electromagnetic field was identi¬ 
fied experimentally with the velocity of light. Because of their phenomenal 
success in describing electromagnetism, Maxwell’s equations inspired a 
revolution in physics. In 1905 Einstein proposed the special theory of 
relativity, which recognized c, the velocity of light, as having a special 
significance in being the maximum velocity attainable by any physical entity. 
This recognition led to a revision of the ideas of spacetime and motion that 
had been held sacrosanct by the physicist since Newton. 

Special relativity explained why the law given by (2) failed to describe 
all electromagnetism. The interaction, according to (2), is instantaneous; i.e., 
(2) assumes that the interaction propagates from one particle to another 
with infinite speed. This assumption is refuted by the rule that c is the 
maximum velocity possible in our universe. 

This conclusion naturally led people to wonder whether (1) was also 
wrong! Once we accept special relativity, we must object to (1) on the same 
grounds that apply to (2). Indeed, if we were able to experiment with rapidly 
moving objects in the laboratory in order to test (1), the discrepancy would 
long ago have been noticed. The massive astronomical objects available for 
study do not move fast enough to reveal any such discrepancy. So objections 
to (1) must be based on theoretical grounds. 

It is possible to modify (1) in several different ways to make it consistent 
with relativity. Einstein himself proposed a radical revision of the law of 
gravitation in 1915, which is known as the general theory of relativity. We 
therefore have the following interrelations between theories of gravitation 
and of electromagnetism: 


Newton’s laws of gravitation and motion 

\ 

Coulomb’s laws of electromagnetism 

\ 

Maxwell’s equations (electromagnetism) 

\ 

Special relativity (motion) 

\ 

General relativity (gravitation) 


In this chapter we shall examine Maxwell’s equations and general relativity 
to see how successful they have been in this century. 
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§2.2. MAXWELL-LORENTZ ELECTRODYNAMICS: 

THEORETICAL FRAMEWORK 


Maxwell's equations can be written most elegantly in a four-dimensional 
format. We define a vector field A i (i = 1,2, 3, 4) to be the 4-potential, and 
derive the electromagnetic field F ik by skew-differentiation: 



Here x\ i = 1, 2, 3, 4, are the spacetime coordinates, with x 1 , x 2 , x 3 , 
representing Cartesian space coordinates, and x 4 — ct the time coordinate. 
We shall assume c — 1 in all formal discussion, and use c explicitly only 
where we are concerned with experiment or observation. Special relativity 
defines an invariant line element 


ds 2 = r\ ik dx l dx k , (4) 

where n] ik = diagonal ( — 1, — 1, — 1, + 1); r) ik is used for raising or lowering 
indices. 

Maxwell's equations consist of two sets. The first one is the identity implicit 
in (3): 



where a semicolon denotes differentiation with respect to coordinates. The 
second set is given by 


F ik ; k = - 477 J\ ( 6 ) 

where J l is the 4-current vector. The equations (5) and (6) permit a gauge 
transformation 


where <p is a scalar. We can choose (f> such that 




Henceforth we shall assume that such a transformation has been made and 
use (8) without the tilde; i.e., 


A 1 .; 



0 . 


(9) 
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It is then easy to see that 



UA % = A\ 


i k 


k 


47rJ l y 


( 10 ) 


which shows that the electromagnetic disturbances in vacuo propagate as 
waves with velocity c(= 1). 

To make connection with the older experimental data, we relate the above 
entities to the pre-relativity quantities—the electric and magnetic 3-vectors 

£ and H, the current density j, and the charge density p—by means of the 
following identifications: 

(A 1 , A 2 , A 3 ) = A (the vector potential), 

A 4 = </> (the scalar potential), 


(7 1 ,/ 2 , J 3 ) =j, 


(^23’ ^31’ ^12) 


J 4 


P> 


H, 


C ^ 41 > £421 £ 43 ) — £• 


( 11 ) 


We then get the familiar Maxwell equations in a medium of unit dielectric 
constant and permeability: 


V • E = 47rp, 
V X H — 477 j -f 


V • = 0, 


dE 

dt ’ 


V XE 


a h 

dt 


( 12 ) 


The gauge condition becomes 


d<p 

V A + — = 0. 

dt 


(13) 


A and c p satisfy the wave equations 


d 2 A 

'dt 2 


V 2 A — 477 /, 


d 2 cj> 
' dt 2 


F 2 <f> = 477 p. 


(14) 


So far we have equations describing how fields are related to the distri¬ 
bution of charges and currents. To complete the picture we need equations 
of motion of charges in electromagnetic fields. These are supplied by the 
so-called Lorentz force equations. In the 4-dimensional format, these are 



( 15 ) 
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where m 0 is the rest mass and e the charge of a particle moving in the field 
F ik with 4-velocity u\ If v is the corresponding Newtonian 3-velocity and 

m the mass, we have 



and in three-dimensional notation, (15) becomes 



-y- (mv) = e[E 4 - v X H]. (17) 

at 

The expression on the righthand side is known as the Lorentz force. 

The Maxwell-Lorentz picture, described above in its simplest form, has 
worked remarkably well in classical physics. More details can be introduced 
to describe properties of various materials, but the basic picture stands. The 
theory had to be modified when the quantum nature of the universe became 
apparent; yet quantum electrodynamics based on the Maxwellian picture 
was still able to explain many of the detailed properties of matter. 

Nevertheless, the theory has certain unsatisfactory features which tend 
to suggest that it is far from perfect. The objections range from purely formal 
ones to computational ones at high energies. We shall review some of these 
in the following sections. 


Retarded and Advanced Solutions 

The electromagnetic problem, according to the Maxwell-Lorentz theory, 
consists of two parts. One, as shown by (10) or (14), is to compute the 
electromagnetic fields due to a given charge-current distribution. The other 
part is to compute the motion of charged particles in given electromagnetic 
fields according to equations (15) or (17). We shall consider the first part 
in this section and the second part in the following section. 

To solve (10) we make use of Green’s functions. Consider a solution 
G(X , X x ) of the scalar wave equation 

DG(X, X t ) = 8 4 (X - XJ, (18) 

where S 4 (2f — X x ) is the four-dimensional Dirac delta function and the □ 
operator is with respect to coordinates of point X. The details of such 
solutions are discussed extensively in the literature. For the present we shall 
assume that a solution exists. 
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Consider now a four-dimensional domain f surrounding a point X v and 
bounded by a 3-surface 2. Using Green’s theorem. 



A i k G)n k d. 



A i;k G\- k dK 


where the diflerentiations are with respect to X , which is a variable point. 
Thus we have 



KCO G(X, X x ). k - A^X).. G(X, X,))n k dX 



\A { DG - UA,G\dV 


Ai(X x ) - f 4nJ i (X)G(X, X x )dV. 


Hence 


MXi) 


— 4v f G(X,X 1 )J i (X)dV + f [A l (X)G(X,X 1 ). k 


- At(X) ;t G(X, XJJn* dZ. (19) 

We are therefore able to express the value of A x at an interior point in terms 
of the source J i in V and the values of A { and its normal derivative on the 
boundary 2 of V. This result is known as Kirchhoff’s theorem. 

So far we have not said anything about the form of G. We now choose 
an explicit solution of (18) given by 



where (x, t) and (x v t x ) are the coordinates of X and X v respectively. It can 
be verified that (20) indeed satisfies (18). 

From (19) we now see that the contribution to A i at (jc 1 , t x ) comes from 
only those points which satisfy 



i.e., from points on the past light cone of X v The values of J t or A i in the 
future do not appear to make any contribution at all. In particular, if we 
choose V as the slab of spacetime 




(t 2 C C)’ 


(22) 
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the contribution to comes only from the past light cone of X\ bounded 
by t = t 2 (see Figure 2.1). If further we let t 2 — oo, and assume that 
the contribution of the second integral in (19) tends to zero, we are lett with 
just the volume integral. Then (19) can be written in the form 




The solution obtained in (23) is known as the retarded solution, for the 
obvious reason that the potentials at (x v / x ) are determined by the charge- 
current distributions at earlier times. If we had retained the second integral 
in (19) over t — t 2 , but not over t — / 3 , we would have additional contri¬ 
butions from the values of A and cj> specified at an earlier time t 2 . This 
addition of the second integral also does not violate the retarded nature 
of the solution. 

The retarded solution appeals to our intuition because it conforms to the 
usual idea of causality. If we imagine the electromagnetic disturbances as 
generated by charge movements, then we expect these disturbances to arise 



Advanced and retarded Green’s functions. Retarded contributions to (r,, /,) come from 
the past hypersurface t = t 2 . Advanced contributions come from the future hypersurface 
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subsequent to the motion of the charges. This is expressed by the retarded 
solutions (23). It would seem, at first sight, that we have achieved the 
remarkable result of explaining causality through Maxwell’s equations. 

Such a conclusion is erroneous, however, as can be seen from the fact 
that we obtained (23) with the use of (20) for G, and this is not a unique 

procedure. We could have chosen other forms of G, with different results. 
For example, if we had taken 


G 


1 


4tt\x 


^iy _ g + \ x — *ii]» 


(24) 


which is also a solution of (18), we would have arrived at 


A 


(*i. h) = f 


j(x, t 1 + 


x — x 


X — X 


'—d^x. 


r p( x > G + 
G) - J - - — 


*iD 


X — X 


1 


d 3 x, 


(25) 


instead of (23), together with surface integrals, which might vanish. In (25) 
the disturbances in the electromagnetic fields appear to arise prior to the 
motion of the charges! This solution is called the advanced solution. 

Clearly causality is violated by (25). But since (25) also satisfies Maxwell’s 
equations, we cannot deduce causality from Maxwell’s equations. Indeed, 
these equations are invariant under time reversal, 


(26) 

and so for any retarded solution there must also exist an advanced solution, 
and vice versa. 

We may ask which of the two solutions, (23) or (25), is the correct one. 
To answer this question, we go back to (19), which retains the surface 
integral. Provided we thus retain the surface integral, either way of describ¬ 
ing A i is correct. When we apply (19) to the slab t 2 < t < t 3 and let 
t 2 ——oo or t 3 —oo, the same conclusion holds, provided we retain the 
surface integrals. However, when we make the assumption that one or 
both of the surface integrals over t — t 2 ,t — t 3 , vanishes, we are introducing 
a new element which is not contained in the Maxwell equations. It is this 
element, which is a boundary condition, that makes the solution take the 
form (23) or (25). In order to be consistent with causality, we choose the 
boundary condition leading to (23) rather than (25 ). For Maxwell’s equations 
themselves, the choice is arbitrary. The equations supply us with both 
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advanced and retarded solutions (and, because of their linearity, with any 
linear combination of them). Of these, we select the one in agreement with 
experience. 

With so many solutions theoretically possible, why does nature always 
select the retarded one? That this question cannot be answered within the 
framework of Maxwell’s theory must be regarded as one of its intrinsic 
weaknesses. 


The Paradox of Self-Action 

We now turn to the influence of fields on charged particles, as expressed 
by (17). Although (17) poses no problems so long as the fields E , H , are 
due to other sources, a curious situation arises when we include in E and 
H the fields produced by the charge itself. Theoretically, these fields must 
also act on the charge. If, following the arguments of the last section, we 
take these fields as retarded ones, they carry energy from the charge. By 
conservation of energy, we expect the charge to lose energy, and the “self” 
force to damp the motion of the charge. 

Using Maxwell’s equations, we can evaluate the retarded fields produced 
by the moving charge. To obtain the self force, we need this field taken 
at the charge itself. Now, for a point charge the field is infinite, but the 
self force is finite, because it arises from the averaging of the self field in 
all directions. In the averaging process, many infinite terms cancel, and we 
are left with a force term which is finite. The limiting process is an involved 
one; see Dirac (1938a) for details. We shall here try to reproduce some of 
Dirac’s ideas in a simple, nonrelativistic way. 

Consider an electric charge e moving in a straight line in an external field 
E in the x-direction. The equation of motion is 


mx = 


eE -b 




where the last term is the self-force term obtained after the limiting operation 
described above. It is easy to see that if the charge is oscillating, the self-force 
damps the motion. For the rate of work done by the self-force. 



The time-average of the first term on the righthand side of (28) vanishes 
in an oscillatory motion, whereas the second term is negative definite. Thus 
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the charge loses its kinetic energy at an average rate of 


9 •• 9 

— e~ x ~ . 
3 




If we calculate the electromagnetic waves generated by the charge, we find 
that they carry energy across a sphere at infinity at the same average rate 
(29). This verifies the conservation of energy mentioned above. At first sight, 
then, the self-force appears to act correctly. However, a paradox begins to 
appear when we apply (27) to the case E = 0. We then have 



This has the trivial solution x = constant. It also has an exponential solution, 

x = (constant) exp(//A). (31) 

Although x = constant causes no problem, the exponential solution leads 
to unbounded velocities as t oo. Nor can the solution (31) be avoided. 
For example, if in practice a charge passes from a region of non-zero 
electromagnetic field to a region of zero field, it will not in general be true 
that all higher derivatives x , x,..., are zero on entry to the zero field. Hence 
the solution (31) is generated. 

To avoid unbounded solutions, Dirac suggested the following procedure. 
Consider a simple example where the charge is unaffected by any external 
force except for an impulse at t = 0. The equation of motion for this case 
can be written in the form 


x = P <5(0 + \x 9 (32) 

where P is the strength of the impulse. Now, a conventional solution of (32) 
would be: 


x = 0, for t < 0; 

x = P(\ - e t/x ), for / > 0. (33) 

As mentioned before, this solution is unbounded. Instead of (33), Dirac 
proposed the following solution: 

x = P exp(//A), for t < 0; 

x = P 9 for t > 0. (34) 
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Notice that, because of the presence of the x term in (32), x rather than 
x is discontinuous at t = 0 in both solutions. The second solution is bounded 
for all times. However, this boundedness has been achieved at a cost. In 
(34), although the impulse is applied at t = 0, the charge begins to move 
earlier, and builds up a velocity and acceleration that the impulse just cancels 
at t — 0. This anticipatory movement of the charge violates causality. Dirac 
argued that this breakdown of causality may be due to the internal structure 
of the electron. However, to a physicist accustomed to the usual ideas of 

causality, the solution (34) will be unpalatable. 

Yet the conventional solution (33) also has conceptual as well as mathe¬ 
matical difficulties. One may ask, “Where does the charge get its kinetic 
energy if its velocity becomes larger and larger according to (33)?” The 
answer lies in the self-field of the particle. The energy of this field is infinite 
for a point charge, and the particle is able to draw on its source indefinitely. 
This is no more palatable than the acausality of (34). 

It seems, therefore, that infinite self-interaction cannot be avoided in 
classical Maxwellian electrodynamics. With the introduction of quantization, 
it was hoped to circumvent these difficulties. Although it does turn out that 
quantum theory reduces the order of this infinity, it cannot altogether 
remove it. Furthermore, quantum theory introduces another infinity of 
non-classical origin, the vacuum polarization. Elaborate renormalization 
procedures are required to extract meaningful finite results from apparently 
divergent calculations. 

These considerations suggest that all is not well with Maxwellian electro¬ 
dynamics, in either its classical or its quantum form. On the other hand, 
its impressive experimental successes suggest that the theory does represent 
some considerable measure of truth and should not be lightly abandoned. 
In Chapter 3 we shall investigate another form of electrodynamics, which 
resembles Maxwell's in its experimental details, but which is better able to 
meet the type of criticism we have raised. 


§2.3. GENERAL RELATIVITY: THEORETICAL 

FRAMEWORK (LECTURE 2) 

Einstein approached the subject of gravitation in a remarkable way. He 
argued that gravitation is intimately connected with the geometric properties 
of spacetime. To follow this argument, it is helpful to compare and contrast 
the properties of gravitation with those of the electromagnetic field. From 
the broad similarity of Newton's and Coulomb’s laws, we might expect 
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gravitation to be a field in the Maxwellian sense. However, the electro¬ 
magnetic field is capable of being switched on or off. The absence of any 
observed “anti-gravity" in nature suggests that the same is not true of 
gravitation. Thus the gravitational field in a given region of spacetime is 
more permanent than the electromagnetic field. 

Einstein expressed this result by relating gravitation to the spacetime 
structure itself, that is, by postulating that the presence of gravitation means 
the spacetime geometry is non-Euclidean. In the special theory of relativity, 
the invariant line element 


ds 2 = r] i)c dx l dx k 

plays a fundamental part. General relativity requires (4) to be modified to 
take account of gravitation. In general we write 

ds 2 = g ik dx i dx k , (35) 

where x 1 (i = 1, 2, 3, 4) are not necessarily Cartesian coordinates, although 
(35) retains a critical property of (4): the signature of the quadratic form. 
Although g ik can now be real functions of x\ the form (35), when diag¬ 
onalized at any point, must have as a signature either (— — — + ) or 
(+ + + —). We shall use (— — — +) throughout. 

In special relativity, (4) is invariant under Lorentz transformations. In 
general relativity, (35) is invariant under general coordinate transformations 

x l — >-x' i (x k ), where x fi are twice-differentiable functions of x k . 

The effect of coordinate transformations is to cause vectors and tensors 

to undergo transformations also. A contravariant vector A 1 transforms 
according to 



and a covariant vector B k as 





Tensors are defined by appropriately extending these laws. 

The geometric properties of spacetime are those of a Riemannian mani¬ 
fold. Thus we define parallel displacement of vectors A 1 , B k , through an 
infinitesimal coordinate change 8x l by 

8A l = - 


r\i A k Sx l , 


SB k = r\ t B { Sx l , 


(38) 
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where 



the matrix with elements g vm being reciprocal to the matrix || g im 
The covariant derivatives of A 1 and B k are given by 



a A i 

dx l 





dx l 





The “non-Euclidean” nature of the geometry is apparent when we displace 
A 1 or B k parallel to itself along a closed curve. It fails to return to its starting 
value! Quantitatively this failure is expressed by the noncommutativity of 
repeated covariant differentiations: 



and a similar relation for B k . The tensor R 1 ^ is the Riemann-Christoffel 
tensor. It is given by 



From R l mlk we can construct two lower-rank tensors: 

R — R i R — R 

f'mk mki ’ ^ <5 -*'■ mk • 



R mk is called the Ricci tensor and R the scalar curvature. The combination 



has zero divergence: 


G mk ;k = 0. (45) 

G mk is the Einstein tensor. 

These relations characterize the geometric properties of spacetime. Physi¬ 
cal ideas are introduced through Einstein’s equations of gravitation: 

Gik — ~ K Ti k , (46) 

where T ik is the energy-momentum tensor of matter and of all physical 
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interactions, and k is a coupling constant chosen to be 

k = 877 ( 7 , ( 47 ) 

where G is the Newtonian constant. The choice (47) is dictated by the 
requirement that the gravitational equations reduce to the Newtonian form 
in the limit of weak fields, i.e., when g ik differs from j] ik by quantities of 
the first order of smallness. The details of this analysis are found in standard 
textbooks on relativity, and we shall not repeat them here. We point out 
in passing that, because of (45), T tk . k = 0; i.e., the law of conservation of 
energy follows from Einstein’s equations. 


Gravitational Collapse: Black Holes 

Gravitation possesses an exceedingly curious property, which can be illus¬ 
trated by comparing the gravitational attraction between two masses with 
the attraction between two particles connected by a stretched elastic string. 
In the latter case, the particles approach each other until the string attains 
its natural length; the attraction ceases thereafter. In the former case, 
however, the particles approach each other with ever-increasing attraction— 
until they bump into each other. The difference in the two cases is reflected 
in the potential energy stored in the two systems. In the elastic-string case, 
the potential energy is given by 

V = X(r — a ) 2 , (48) 

where A is a constant, r is the separation of the particles and a is the natural 
length of the string. In the gravitational case, the potential energy, obtained 
from Newton’s law (1), is 



Gm 1 m 2 

r 



The expression (48) is positive definite and is characteristic of most systems 
in the world. But (49), on the other hand, is negative, and goes to —00 as 
r 0. In both cases the system seeks to attain the position of minimum 
potential energy. Whereas with (48) it is able to do so in a smooth, steady 
way, with (49) the system undergoes catastrophic collapse. Of the various 
known interactions in physics, gravitation appears to be unique in exhibiting 
this property. The fact that (49) is negative is often described by the state¬ 
ment that the energy stored in a gravitational field is negative. 
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This discussion is based on Newton’s law of motion, and one wonders 
whether general relativity is different in this respect. As it turns out, general 
relativity leads to an even more catastrophic situation, which we shall discuss 
in some detail, since it has occupied the attention of physicists for the last 
decade. 

We discuss first a simple problem using Newtonian laws. Consider a 
uniform spherical object of mass M , density p 0 and radius R 0 which begins 
contraction under its own gravitation. We assume no internal pressures. Then 
a particle on the surface of the object, of radius R at time /, has as equation 
of motion 


• • 



GM 

R 2 ’ 



with R = R 0 , R = 0 at time t = 0. It is easy to see that (50) gives 







so the object shrinks to a point in a time r, 




2 GM(R tl - 






where 



Thus a uniform object with the mean density of the Sun, ^2 gm cm -3 , would 
shrink to a point in about 25 minutes if all its internal pressure were 
withdrawn. 

This Newtonian calculation is very closely simulated by general relativity. 
It is found that the interior of the object can be described by the line element 




dr^_ 

— ar 2 


+ r 2 (sin 2 0 d<j)' 2 + d0 2 ) 


r < R 0 > ( 54 ) 


where r, 0 , <f>, are spherical polar coordinates that are taken constant for 
a freely falling particle, and t is the proper time of the particle. The exterior 
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solution, r > R 0 , is given by 



c2 / < R o 3/2 \ f K 2 dr 2 

\ r 3/2 /LI- ( aR 3 /r) 


+ r 2 (d0 2 + sin 2 # d<f> 2 ) 




<«> 

in the same “comoving coordinates.” The function S(t) in the interior rep¬ 
resents the fraction by which the object has contracted at any given time t. 
If we define R(t ) = R 0 S(t), then R(t) satisfies the same equation (51). 

Thus, according to general relativity, the object collapses into R = 0 in 
the same time as that given by (52). However, the situation here is more 
drastic, for not only does the body fall into a point, but spacetime itself 
becomes singular at t = m/Q.\[a)\ 

At first one might imagine the following way out of this situation. Consider 
an observer exterior to the object, and not falling into it. Such an observer 
can work in terms of the exterior Schwarzschild line element, 

ds 2 — dT 2 {\ — )- d&__ _ p2(d02 + sin20 d<t>2) ' (56) 

P 

where 

p = rS, T — T(r , t). ( 57 ) 

If such an observer uses light signals to maintain contact with an observer 
who is falling freely with the object, the following situation develops. So 
long as the freely falling observer satisfies the condition 

rS(t) >2GM, (58) 

any signals transmitted by him will eventually be received by the external 
observer. Contact is broken, however, when (58) is violated, as it eventually 
must be. As this stage is approached, the signals take longer and longer 
in T-time to pass from one observer to the other, and the violation of (58) 
is never discovered by the external observer, no matter how long he lives. 
When (58) is violated, the object is supposed to have become a “black hole”; 
i.e., it is so highly collapsed that its strong gravitational field prevents any 
escape of light. 

Thus no spacetime singularity at r — 0 is ever visible to the external 
observer. All he can ever see is an object collapsing toward a black hole. 
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This apparent resolution of the singularity problem is not satisfactory, 
however, since it excludes the experience of the freely falling observer. A 
more satisfactory resolution would require spacetime to be nonsingular 
everywhere. 

The collapse of the object can be halted in Newtonian gravitation by 
putting sufficiently strong pressures in the object. The same prescription does 
not work in general relativity, however. Any agency causing pressure has 
energy, and energy attracts, according to relativity. Thus the self-gravitation 
of the object is increased by the very agency that is meant to stop it. These 
general concepts were quantitatively expressed first by Penrose (1965), who 
showed that under certain general conditions no ‘‘reasonable” physical 
agency is capable of halting gravitational collapse into a singularity. A 
“reasonable” agency is one which has positive energy in the sense discussed 
earlier in this section. 

Is the singularity of gravitational collapse an academic one? Recent 
astronomical discoveries of quasars and pulsars suggest the contrary. Astron¬ 
omers are beginning to observe massive, highly condensed objects in which 
self-gravitation is likely to be the predominant force. The continued stable 
existence, and sometimes outward expansion, of these objects suggests that 
the ideas of general relativity might require modification for strong gravita¬ 
tional fields. 


Cosmological Models: Singularity 

The singularity described above also appears in the cosmological solutions 
of Einstein’s equations. The simplest of the cosmological models are the 
homogeneous and isotropic ones described by the Robertson-Walker line 
element. 




-b r 2 {d0 2 + sin 2 # d(fi 2 ) 



Here / is the cosmic time and r, #, </>, the spherical polar coordinates of a 
typical galaxy—or a fundamental observer. The parameter A: is 0 or zb 1. 
The spaces t = constant are homogeneous and isotropic. They are flat if 
k = 0, have positive curvature and finite volume if k = +1, and negative 
curvature it k = —1. S(t) is a scale factor. For a dust model, the energy- 
momentum tensor is given by 








§2.4. ACTION PRINCIPLE FOR ELEC; ROMAGNETISM AND GRAVITATION 


25 


where p — p(t), and u l = (0,0,0, 1) corresponds to the motion of a funda¬ 
mental observer. The Einstein equations then lead to 


and 


pS 3 = constant = A (say), 






The similarity between (54) and (59), and between (51) and (62), is 
obvious, and indeed the solution for a finite object (51) is identical with 
that for the universe if k = 1. The only difference is that for a finite object 
R(t) was decreasing with t , whereas for an expanding universe S(t) increases 
with t. The singularity at S = 0 is the same as that discussed earlier. The 

cases k — 0, — 1, also have the singular state S = 0. This state is described 
in cosmological studies as the "big bang.” 

Work by Hawking and Penrose (1969) has shown that the singularity 
S' — 0 is inevitable in all cosmological models under certain quite general 
conditions. As with a finite object, one of these conditions is the positive 
definiteness of the energy of all physical systems contained in T ik . This 
condition, expressed by 


T ik v i v k > 0, (63) 

where v l is any timelike vector, is satisfied by all interactions in physics 
known so far. We are therefore led to a catastrophic situation for the universe 
similar to that for a finite collapsing object. 

The cases discussed here and in the previous section suggest that some¬ 
thing may well be wrong with a straightforward extrapolation of our knowl¬ 
edge from weak gravitational fields to strong ones. This possibility is recog¬ 
nized by many theoretical physicists, although they differ in their approaches 
to solving the problem. 


§2.4. ACTION PRINCIPLE FOR ELECTROMAGNETISM 

AND GRAVITATION (LECTURE 3) 

The Maxwell and Einstein equations discussed so far took their present form 
only after many experiments and much trial and error. It turns out, however, 
that they can be derived in a simple and elegant way from the principle 
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of stationary action. Since we shall be concerned with this principle in the 
new approach to these subjects, it is worth repeating the derivation of the 
Maxwell and Einstein equations from an action principle. 

The action is given by 

s = iho S d * x ~ 2f m o da 


- F ik Fik \/-i d * x - 2 f e « A i dc,i • (64) 

a 

The various quantities appearing in (64) have all been defined before. In 
particular, e a9 ra a , refer to the charge and mass of a typical particle denoted 
by a. The variable parameters in (64) are A i and g ik and the world lines 
of the particles. We obtain the required physical equations by setting 8S = 0 
for small arbitrary variations of these parameters. 

It is easy to verify that variation of A i leads to Maxwell’s equations in 
curved space: 


where 








8 4 (X,A) da 1 

V - g(X) da 




is the current vector. 1 

Variation with respect to geometry g ik is more cumbersome, but eventually 
leads to 



-8 vG(T'\ m) 



where T lk (m) is the matter tensor, 


T ik ax) 



8 4 (X,A) da 1 da k 

—... r- da , 

y—g(X) da da 


and T lk ( em) is the electromagnetic tensor, 




^he gauge being chosen so that A k . k = 0. 
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m 


gi 
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Finally, variation of the world line of a leads to the equation of motion 
of a: 


d 2 a x da k da 1 

da 2 + kl da da 




For an uncharged particle we get the geodesic equation. 

The three variations are actually interdependent, as may be seen by taking 
the divergence of (67) in the neighborhood of a particle a. By taking appro¬ 
priate limits, and using (65), (66), we arrive at (70), because the variation 
of a world line can be described in terms of variation of g ik and of A t . This 
point is considered in some detail in Chapter 8. 


Conformal Invariance 

We end this chapter with a brief discussion of conformal transformations, 

which will play an important part in later work. 

Consider a spacetime manifold M in which the metric is given by g ik . 
If we now change g ik to a new metric g* ik , given by, say, 




2 ? 


gik 



we have a new spacetime M *. When £2 is a finite nonzero function of the 
coordinates, (71) is called a conformal transformation. It is easy to see that 
any element of length ds in M transforms to ds* in M , where 

ds* = fids = ds. (72) 


Thus the ratio of two such lengths in different directions, but at the same 
point, remains unchanged under a conformal transformation. 

A quantity is said to be conformally invariant if it does not change under 
(71). One example is the electromagnetic action. It is easy to see that if we 

put 



then the electromagnetic part of (64) is equal to a similar expression in terms 
of a% and F* ik with respect to M*. Variation of A* i in this invariant part 
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of the action leads to Maxwell’s equations with respect 2 to F* ik and to M*. 
These equations are said to be invariant under conformal transformations. 
We shall consider later whether conformal invariance is a property that 
should be satisfied by other parts of physics. 

The space M is said to be conformally flat if we can find a coordinate 
system in which 


g ik — Vik’ 

gik “ @ 2 Vik- (74) 

Later we shall show that the Robertson-Walker spaces given by (59) are 
conformally flat. 

Einstein’s equations are not conformally invariant. It can be verified that 
under the conformal transformation (71) R imlk , R ik , and R transform as 

(75) 

(76) 

(77) 

where 


R*< 


imlk 


e2 ^[Rimlk + gik$ ml T gml^ik gil^ 


mk 


gink ^i 


il 


T (gikgml gilgmk ) ^1 £]> 


R\ 


ik 


Rik + ik + + 2 A^), 


R 


e~ 2 !(R + 6D? + 6 AJ) = tt~ 2 (R + bi?" 1 ^), 



= SJ*- 



From (75), (76), (77), we can construct another tensor which is invariant 
under (71). This is the Weyl conformal curvature tensor: 


C h 

^ ijk 


1 


R h ijk T ^ ig h jRik g h kRij + gikR k j gijR 


1 


+ jR(g h kg ij -g h j gik)l (79) 


■^But not with respect to the wave equation for A* t . If one wishes to obtain ri/4 *. — 477 /*^ 

it is necessary to add a gauge transformation in order that A* k . k = 0, i.e., to define A* { by 

4*i = T + 4> ; i> where the scalar function 4> satisfies an equation of the form given in (26) 
of Chapter 5. 


THE ABSORBER THEORY OF 
WHEELER AND FEYNMAN 


§3.1. THE FOKKER ACTION (LECTURE 4) 

In 1845, in a letter to Weber, Gauss wrote about action at a distance in 
electrodynamics: “I would doubtless have published my researches long ago 
were it not that, at the time I gave them up, I had failed to find what I 
regarded as the keystone, Nil actum reputans si quid superesset agendum: 
namely, the derivation of the additional forces—to be added to the inter¬ 
action of electrical charges at rest, when they are both in motion—from an 
action which is propagated not instantaneously but in time, as is the case 
with light.” 

Gauss’s problem was solved in the early decades of the present century 
by several physicists, notably Schwarzschild, Tetrode, and Fokker. Essen¬ 
tially the problem was to make action at a distance relativistically invariant. 
We give here the formulation devised by Fokker, in which the action is 

e a e b 8(s 2 AB )da i db k i] ik , (1) 

a a <b 


where 
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a\ b\ being the 4-coordinates of typical points A and B on the world lines 
of particles a , b , respectively. We are here regarding the particles as being 


ordered numerically, to give meaning to a < b. The notation implies that 
in the double sum the pair a , b , is counted only once. 


Thus the charges interact if and only if A and B can be connected by 
a light ray. In the language of special relativity, “action at a distance” is 


an unfortunate description. We really have action at zero 4-dimensional 
distance, s 2 AB = 0. 

It is instructive to compare S with the action for the Maxwellian theory 
given in Chapter 2. The second term of (1) includes all the electromagnetic 
effects of the Fokker theory, and should be compared with both the pure 
field term and the interaction term of Maxwell’s theory. To show that (1) 
in fact describes the same electrodynamic properties of matter as Maxwell’s 
fields do, we define the following quantities: 



(3) 



(4) 



( 5 ) 


A t (b) and F ik b) may be called the potential and field arising from particle 
b , whereas J^ b) is the 4-current density represented by the motion of the 
charge b. Because of Dirac’s identity. 



( 6 ) 


we get 


UA i {b \X) = 4vJt b \X). 


(7) 


Also, if the charge e is neither created nor destroyed, i.e., if its world line 
stretches from t — — oo to t = Too, we get 



( 8 ) 


It follows from (7), (8), and (4) that 


F ibHk . k = -47Tj (bH 


(9) 
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The similarity between (7), (8), (9), and the Maxwell equations now 
becomes more apparent. It should, however, be remembered that, unlike 
the Maxwell equations, (7), (8), and (9) are identities because of the defini¬ 
tions (3), (4), (5). The fields F {b) ik or the potentials A ib \ are defined in terms 
of the particle world line; they have no degrees of freedom of their own. 
To distinguish them from ordinary fields, we shall call them direct-particle 
fields whenever there is likely to be any confusion. 

The variation of the world line of a leads to the equation of motion of 
charge a: 



This again is similar to the Lorentz force equation, but with the difference 
that the field of the particle a itself is specifically excluded from the righthand 
side. There is no “self action.” 

This completes the formal description of the theory of direct interparticle 
action according to Fokker. Although superficially there is similarity with 
Maxwell’s theory, this theory is different in one crucial practical respect. 
As the action functional (1) indicates, there is complete time-symmetry. This 
is shown more explicitly if we work out the potential (3). It has the form 





A* 


i(adv) 




where A i(ret) (b) is the retarded solution of Maxwell’s equations for the same 
charge, and A i(adv) (b) is the advanced solution. The light cone from a general 
point X intersects the world line of b in two points. The motion of b at the 
point on the past part of the cone defines the retarded solution, whereas 
the motion at the point on the future part defines the advanced solution 

(see Figure 3.1). 

As discussed in Chapter 2, our experience is consistent with retarded 
solutions, and therefore in disagreement with the above formulation. For 
this reason, although the theory so far developed surmounted the difficulty 
of the lack of Lorentz invariance of the Coulomb theory, it still seemed a 
long way from being a successful rival to Maxwell’s theory. It was not until 
1945 that the next important development in the theory took place. 
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FIGURE 3.1 

Advanced and retarded solutions. 


§3.2. THE RESPONSE OF THE UNIVERSE 

(LECTURE 5) 

The presence of advanced solutions side by side with the retarded ones can 
lead to paradoxical situations. Let us describe one as a typical example. 

Suppose two observers, A and B , situated one light-hour apart, can 
communicate with each other via advanced as well as retarded light signals. 
They agree to set up a signaling system in which A sends a signal to B at 

4 p.m. if and only if he does not hear from B at 4 p.m., and B sends a signal 
to A at 5 p.m. if and only if he does hear from A at that time. Will this 
arrangement work? In a world of retarded signals only, it can work. But 
a little reasoning tells us that it is unworkable in the present situation. For, 
if A does send a signal at 4 p.m., it reaches B at 3 p.m. and 5 p.m. So B sends 
a signal at 5 p.m. which reaches A at 4 p.m. and 6 p.m. But if A does get 
a signal at 4 p.m., he should not send one out at 4 p.m. The other alternative 
is also impossible. 

That a theory with advanced and retarded signals can be made to work 
in the real world was first realized by Wheeler and Feynman. They pointed 
out that in the real world one has to take into account a “response of the 
universe.” How the universe enters the argument in a crucial way can be 
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seen as follows. Suppose we move an electric charge a at time t — 0. Its 
retarded effect is felt by another charge b at distance r at time t — r. The 
advanced reaction of b on a reaches the charge a at time t — r — r — 0! 
Thus the reaction is instantaneous, no matter how large r may be. To find 
out the electromagnetic effects of moving charge a , we must therefore take 
into account the reactions of all other charges b in the universe. The com¬ 
bined reaction of all these charges is termed the response of the universe. 
Wheeler and Feynman gave the following simple calculation of the response 
in a static Euclidean universe with a uniform distribution of charges of 
different species. 

Suppose we move the charge a , and describe its general motion by a(t). 
The velocity and acceleration of a are d(t) and a(t), respectively. Since the 
calculation is linear throughout, we can proceed by Fourier analysis. Let 
a typical Fourier component of the acceleration be given by 


( 12 ) 



To simplify the picture, Wheeler and Feynman imagined the charge a 
located at the center of a spherical cavity of radius R. The cavity is supposed 
to be empty except for charge a. The medium outside the cavity is uniform, 
with N charges e k per unit volume. 

In vacuum, the full retarded electric field of a at large distance r (see 
Figure 3.2) would be 


e 


(13) 


sin 0 exp[/to(r — /)] 


r 


in the direction of increasing 0 , where e is the charge of <2, and 0 is the 
angle between the radius vector and the acceleration. Modifying (13) to take 


(r, 0, <f>) 



The direction u () of the acceleration of the 
source particle gives the axis ot the spheri 
cal polar coordinates r, 0, </>. The retarded 
electric field in vacuum due to the source 
particle has magnitude & and is in the di¬ 
rection of increasing 6. 


FIGURE 3.2 
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account of the medium, first we have a factor 

2(1 + n — iky 1 

arising from reflection at r — R , where n — ik is the refractive index of the 
medium. We also have an additional phase factor 

exp[/'co(fl — ik — l)(r — R)] (15) 

due to transmission in the medium. The electric field in the present problem 
is therefore 


& 


2 eu 0 sin 0 
r( 1 + n — ik) 


exp(/co[r — t + (n — ik — l)(r — /?)]} 


(16) 


in the direction of increasing 6. This field acts on a charge b to produce 
an acceleration 

—/>(«)£ (17) 

m 


also in the direction of increasing 0. Here m is the mass, e k is the charge 
of /?, and p{u) is a frequency-dependent factor which includes damping 
effects. The function /?(co) is related to the refractive index by 

4 77 NP ~ 

(n - ik) 2 = 1- *-p( co). (18) 

The motion of charge b produces a half-advanced field at a, which 
constitutes the reaction of b on a . The component of this field parallel to 
i# 0 has the magnitude 


k 


m 


P( w) 



sin 0 ) exp( — iur)&. 


(19) 


The net response C :R of the universe is obtained by summing (19) over all 
charges b y a. 




=/ 


oo 



7T 'I'TT 


r-R~e = 0^0 = 0 



e k 


p(oj) sin 0 e ™ r & • Nr' 2 sin 0 dr dO d(j>. (20) 


Straightforward integration, using (16) and (18), leads to 


(R 


2 . 
3 


ico eu 0 e 


—iu)t 


( 21 ) 
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A similar calculation shows that the component of the response normal to 
w o averages to zero, so that (21) can be written in the form 


cn 


ico 

3 


u 0 e 


-iut 


( 22 ) 


We now go back to the Fourier sum of (12). It is clear from (22) that 
the response for the sum will be given by 


2e ... 

a. 


3 


(23) 


and the charge experiences a force of reaction given by 


2<? 2 ... 

~Y a ' 


(24) 


If we had calculated the response field not at r — 0, but in the neigh¬ 
borhood of r — 0 in the cavity, we would have found that it amounts to 



where F {a) ret is a typical retarded-field component of a. This calculation, 
which is slightly more involved than above, was also given by Wheeler and 

Feynman. They argued that (25) modifies the time-symmetric field of a , 
namely 


y ^ (a) ret + y ^ (a) adv> (26) 

to produce the full retarded field observed in practice. 

The above calculation is a self-consistent one. We start with the assump¬ 
tion that the net electromagnetic disturbance (13) from the particle is the 
full retarded one. We then show how this disturbs other charges and we 
calculate their response. The response must come out to be that given by 
(25) for the calculation to be self-consistent. 

It is worth emphasizing how the two parts of this calculation differ in 
the use of the refractive index. In the first part, we anticipate that the net 
field in the neighborhood of each particle will be the retarded one. As this 
travels outward, it disturbs other charges whose field, as a result of this 
disturbance, must be added to the original field. This many-particle process 
is described by the use of the refractive index. In the second part of the 
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calculation, we calculate the advanced reaction produced by each charge 
b on a as determined by the action terms 

- e a e b J f S(s~ AB ) da i db\ (27) 

and this two-particle interaction has no refractive index involved. 


The Absorber Theory of Radiation 

The remarkable fact which emerges from this calculation is that the final 
result ( 22 ) does not involve details of the rest of the universe, e.g., parameters 
like vV or n — ik. These quantities appear in the intermediate steps, but are 
neatly canceled out in the end. This cancelation is not a coincidence, but 
involves an important property of the universe: the property of perfect 
absorption, which means that when the charge a is excited, all electromag¬ 
netic disturbances arising from it—directly or through the reaction of other 
charges—should tend to zero sufficiently rapidly at great distances from a. 
Mathematically, we say that 

S J l^ret + ^advl ~ * ( 7 ) *S ' + »• (28) 

In a vacuum a radiative field drops off as r -1 . What (28) indicates is a faster 
rate of attenuation than r~ x , to include genuine absorption. A universe which 
satisfies (28) is said to be a perfect absorber. Wheeler and Feynman demon¬ 
strated that charges influence one another by means of only retarded inter¬ 
actions in such a universe. Their proof is along the following lines. 

The condition (28) refers to a combination of outgoing and incoming 
waves, and for it to hold at all times, we need to have the two types of 
waves tend to zero separately: 

2 J F (b> ret ~ 0 ( i). s2F ,w adv ~o(i), r+oo. (29) 

The conditions (29) also imply that 

2 J l F<b> ret - F(M aJ " « ( 7 ) as r + 00 . (30) 
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However, the lefthand side of (30) is a field without sources and satisfies 

the homogeneous wave equation. The condition (30) at infinity therefore 
implies that it is identically zero everywhere: 



- T (w adv ] = 0. 


The field acting on charge a can thus be written in the form 



2 
b ^ a 





= 2 F<W ret + J [F ,a \ ei 

b -Jl a 




The first term on the righthand side represents the retarded effects of all 
charges b ^ a. The second term is identical to that obtained by direct 
calculation above. This completes the proof. 

The second term on the righthand side of (32) is interesting in that it 
brings out the difierence in approach to radiation of the Maxwell field theory 
and of the direct particle theory. The form of this term was first suggested 
by Dirac, who pointed out that its limiting value at the charge a is equal 
to the so-called radiative damping force. In field theory this term has no 
further significance, and Dirac's result remains somewhat mysterious. We 
have already noted that the radiative damping force is a self force in the 
Maxwell field theory. But in the above Wheeler-Feynman approach this 
term arises as a response of the universe. A charge experiences this force 
because of the complete absorption of its outgoing field. While absorbing 
a fraction of this disturbance, each absorber particle sends a reaction. The 
total reaction from the whole universe obtained by summing all absorber 
particles then has the above form. 

This latter point of view has the advantage of preventing the runaway 
solutions discussed in Chapter 2, for if the charge gets on to such a solution, 
it will have an unbounded field which cannot be absorbed; so the formula 
leading to this solution will break down. The runaway solution is therefore 
not self-consistent within the framework of the theory. 

The derivation of the general result (32) leaves one point somewhat 
unclear. At what stage have we introduced time-asymmetry into the argu¬ 
ment? Clearly the condition (31) is also time-symmetric, but (32) is appar¬ 
ently not. Suppose we rewrite (32) in equivalent form, 



^retL (33) 
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with the ‘'advanced” and “retarded” terms interchanged. There is nothing 
to prevent us from using (33) instead of (32). To do so within a calculation 
of the kind which led to (22) would be exceedingly awkward, however. In 
deriving (32) we assumed the absorber particle to be at rest before it was 
hit by the retarded source field. This is correct in the case of (32) because 
the first term on the righthand side is uncorrelated with the motion of a , 
but in (33) the first term is highly correlated. If we take these correlations 
into account, we discover that (33) is the same as (32). 

But could we not make a similar self-consistent calculation in which the 
absorber particle came to rest after the advanced source field hit it? Within 
the Wheeler-Feynman theory such a time-reversed calculation can certainly 
be made. It is then the first term on the righthand side of (33) that is 
uncorrelated with the motion of a , and that on the righthand side of (32) 
is now highly correlated. This situation represents an inversion of the 
electromagnetic arrow of time. Thus there are self-consistent calculations 
leading to either sense of causality. Wheeler and Feynman sought to avoid 
this ambiguity by an appeal to thermodynamics. It was argued that for an 
absorber particle to come to rest after being hit by the source field would 
require highly “artificial” initial conditions—conditions with negligible 
statistical probability. Thus according to Wheeler and Feynman the time- 
asymmetry in electrodynamics is connected with the problem of initial 
conditions and thermodynamics. 


§3.3. COSMOLOGICAL CONSIDERATIONS 

(LECTURE 6) 


The original calculation of Wheeler and Feynman was for a static Euclidean 
universe. Such a universe is time-symmetric—unless one introduces asym¬ 
metric initial conditions, as mentioned above. In 1962 Hogarth criticized 
the Wheeler-Feynman approach on the grounds that it did not properly 
include the time-asymmetry of the large-scale structure of the universe. In 
an expanding universe, the future and the past have in general different 
physical properties. Over and above this, the advanced and retarded electro¬ 
magnetic waves undergo changes during propagation. The former are 
blueshifted, the latter redshifted. Thus in an ever-expanding universe, the 
absorption of retarded waves is an interaction of low-frequency radiation 
with matter, but the absorption ol advanced waves is an interaction at high 
energy. Thus the two cycles of argument, one based on retarded electro- 
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magnetic waves, the other on advanced waves, which take the same form 
in a static universe, become different in an expanding universe. Hogarth 
analyzed these problems in certain simple cosmological models, and was 
able to classify these models according to the nature of wave propagation. 
Hoyle and Narlikar (1963) and Roe (1969) later looked at the same problem 
Irom a different point of view. Their conclusions are essentially the same 
as those of Hogarth. We shall follow the later treatment here. 

Before applying cosmology, we need to mention a formal problem con¬ 
cerning the definition of Fokker action. So far it has been defined in flat 
spacetime. In order to apply it to the Riemannian spacetime used in cos¬ 
mology, we must write the action in a generally covariant form. Hogarth 
sought to avoid this step by using the conformal invariance of Maxwell’s 
equations. Since Robertson-Walker spaces are conformally flat, he was able 
to use flat-space solutions for electromagnetic fields straight away in such 

spaces. More strictly, however, one should demonstrate (a) that the Fokker 
action can be written in a Riemannian space, and (b) that the action so 
written is conformally invariant. We shall look at these and other related 
formal problems in Chapter 5. For now, we can assume that the required 
results can be proved. 

We begin by obtaining a general result analogous to that proved by 
Wheeler and Feynman in flat space. We show that in a cosmological model, 
for the retarded field to be consistent, it must be suitably attenuated at 
infinity—i.e., the future part of the universe must act as a perfect absorber. 
For the advanced field to be consistent, the past part of the universe must 
be a perfect absorber. We give the proof for retarded waves. That for 
advanced waves is similar. 

Suppose we move charge a and consider the fields arising from such a 
motion. They are determined by 

J l^ret + ^’advl + yS l^ret + ^advl- <34, 

b ^ a 

We also suppose the motion given to the charge to be bounded in space 
and time. Suppose the retarded fields of all particles in (34) are suitably 
attenuated at large distances from a. This implies that the advanced part 
of (34) also tends to zero suitably rapidly, because the only effective con¬ 
tribution to S b F {b) adv comes from a relatively local set of particles which 
have been excited by the retarded field of a. (See Figure 3.3.) Since the 
acceleration of a is bounded, the set of such particles tends to zero as we 
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FIGURE 3.3 

The retarded field due to the bounded motion of the source particle is considered to be 
damped by perfect absorption outside a 4-cylinder. A sufficiently large distance r from 
the source particle, advanced fields must come from absorber particles b that lie outside 
this cylinder. Hence the advanced field F (b) adv is effectively zero. 

evaluate U b F {b) adv at large distance from a. We therefore have 

X [F' w ret - F< M adv ] (35) 

b 

tending to zero faster than a radiation field in vacuum. Again, since (35) 
satisfies a homogeneous wave equation, it must be zero everywhere. Then 
(34) becomes 

F ( “>ret + £ F«\ ev (36) 

b 7^ a 

i.e., a full retarded field. 

We notice that in this derivation we give an asymmetric status to the 
retarded and advanced fields. The retarded fields are to be self-consistent, 
and hence basic. The advanced fields are secondary, arising from particles 
b ^ a which have been excited by the retarded fields. These particles lie 
in the future part of the universe. We emphasize again that it does not follow 
that we can reverse the sign of the time coordinate, arguing that if retarded 
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fields turn out to be self-consistent then so will advanced fields. To check 

the consistency ot advanced solutions, a similar calculation for the past light 

cone of a has to be made, and this is not a simple reversal of the calculation 
for the future light cone of a. 

We next examine details ot such calculations in some well-known cosmo¬ 
logical models. 


Cosmological Models: Explicit Examples 

We have remarked that the Robertson-Walker spaces are conformally flat. 

The transformation to the conformally flat form is particularly simple if 

k = 0. We shall consider models with k = 0 first, and in some detail. The 
line element 


ds 2 = dt 2 — Q 2 (t)[dr 2 + r 2 {d6 2 + sin 2 0 d(j> 2 )] (37) 


transforms to 


ds 2 = e 2 $[dr 2 — dr 2 — r 2 (d0 2 + sin 2 # d§ 2 )\, (38) 

with the substitutions 


T = / exp «r) = Q{t). (39) 

The scale of the function Q(t) can be chosen for convenience, for example, 
by taking Q — 1 at the present instant. The constant of integration in (39) 
can also be chosen for convenience. 

As we did for flat space, we Fourier-analyze the acceleration u(t) of the 
charge a , and consider a single component with frequency co 0 . Write 

u = u 0 (40) 

and choose spherical polar coordinates #, </>, such that 6 = 0 corresponds 
to the direction of i/ 0 . The electric field and the magnetic field several 
wavelengths away from the charge are in the directions of increasing 6 and 
<#>, respectively. Denoting them by E e and /T, in flat space the retarded fields 
would be given in vacuum by 



eu 


o 


sin 0 exp[ — ico 0 (t 



r 


(41) 
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In a conformally flat space, (41) is modified in that t is replaced by r: 



eu 


o 


sin 6 exp[ — /co 0 (t 




We have to modify (42) further to include the effect of other charges, i.e., 
the refractive index. First we notice that, although the frequency of the wave 
is to 0 with respect to the r coordinate, proper time is given by the t coordinate. 
Using (39) we therefore have the proper frequency 



= c0 °"^’ ~ w ° ex p( - ^’ 



which shows that the retarded waves are redshifted, since exp f increases 
in the future for an expanding universe. 

To calculate the change of phase produced by propagation in the cosmo¬ 
logical medium, we note that a coordinate distance dr corresponds to a 
proper distance 

dz = dr ex p f. (44) 

If n — ik is the refractive index of the medium, the extra change of phase 
produced by propagation through dr is by a factor 

exp i(n — ik — l )oo dz = exp i(n — ik — l)co 0 dr. (45) 

The total phase change due to propagation from a to distance r is therefore 
represented by the factor 



The integral sign appears since n — ik changes with r because the proper 
frequency of the wave depends on r, and perhaps because the density of 
particles in the medium changes. Thus the field at distance r is given by 




e u 0 sin 0 

- : -exp[ - /co 0 (t - r)] • 77. 



Here 17 represents the absorptive properties of the medium. For it to be a 
perfect absorber, we require tj —0 with increasing r. That is, we need 
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as r approaches the “infinite boundary” of the universe accessible to retarded 
waves. This need not correspond to r oo, as we shall see shortly. 

For any cosmological model, we can calculate the integral on the lefthand 
side of (48). If the result turns out to tend to — oo as r increases to the 
“boundary” of the universe, then a self-consistent system of electrodynamics 
involving retarded fields is possible for such a model. If the result does not 
satisfy (48), then a self-consistent retarded electrodynamics is not possible 
for the model in question. 

To calculate this integral, we need to know how k depends on to. This 
dependence is determined by the physical process of absorption. We shall 
consider two such processes: radiative reaction, and collisional damping. 
Hogarth in his paper considered only the second process, and his analysis 
was criticized for using a process which is of thermodynamic origin. If we 
are to consider the problem within the framework of electrodynamics and 
cosmology only, we should not bring in other sources of time-asymmetry. 
To avoid this, we consider first the damping produced by radiative reaction. 

Radiative Reaction. When the absorber particle is set in motion, it acceler¬ 
ates. In a universe with self-consistent retarded solutions, this acceleration 
produces a radiative reaction. For particle b, this is characterized by the 

field 



If, on the other hand, we were looking for self-consistent advanced solutions, 
the sign of (49) would be reversed. The sign of k is thus unambiguously 

determined. 

Suppose an electric field of magnitude & and frequency to is incident on 
an absorber particle of mass m and charge e. The motion in the direction 
of the field is determined by the equation 


mx = 





so that the oscillatory displacement is 



lie 2 
3 m 


to + O(o) 2 ) 


—i<jjt 



Taking N such charges per unit volume, the medium has a polarization Nex 
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in the direction of S. Therefore the refractive index is given by n — ik, where 

(n — ik) 2 S e~ loit = & e~ toit 4 - 4t iNex\ 



1 - (n - 



AirNe 2 

moo 2 


2 ie 2 n( 2 \ 

—— co -|- 0 {co ) 

3 m 



We have written (51) and (52) in a form suitable for low frequencies, since, 
as we pointed out earlier, the absorption in the future takes place at low 
frequencies. To solve for k from (52), we must specify the dependence of 
N on co. In a Friedmann universe, we have 

TVoc-L tocc-L, i.e., TVocu 3 . 

Q 3 Q 

Then we get from (52) 

k — — Xoo~ 2 , X = constant > 0. 

For the special case of the Einstein-de Sitter model, we have 

/3 Ht\ 2/S 2 

Q{t) = Ff) ’ t=z JH = ‘° ’ (55) 


(53) 


(54) 


where H is the Hubble constant at the present epoch t 0 . In conformally flat 
coordinates, putting r [defined by (39)] at zero when / = 0, we have 

exp f = (y Ht ) 2 ’ t ° = ^- (56) 


A light ray emitted at the present r 0 will reach a point with coordinate r 
at time r = r 0 + r. Hence the frequency co is given by 


W —^(Tq -f V ) 


CO 


0 


2 


//(t 0 + r) 


-2 



The integral for absorption is 




( 58 ) 
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Since (58) converges, the absorption is incomplete, and the retarded solution 
is not self-consistent in this model. 

We look next at the steady-state cosmological model, which is charac¬ 
terized by 

N = constant, Q(t) = exp Ht , t — 0 for present epoch, 

expf(T) = ( —//r) _1 , r — _ -L f or present epoch. (59) 

In this model the asymptotic form for k is 

k ~~ —/xco _l , fx = constant > 0. (60) 

Arguing in the same way as above shows that at coordinate r the retarded 
wave has the frequency 


co = co 0 (l - Hr ), 

and the absorption integral diverges: 





jll 

- Hr) 





Thus the steady-state model has a perfect future absorber, and hence 
retarded solutions are self-consistent. We note in passing that the conformal 
transformation t —r limits the accessible region for retarded waves to 
r < H 1 . This is the event horizon of the steady-state universe. 


Collisional Damping. In collisional damping, the absorber particle loses its 
energy by collisions with other particles. If v is the collision frequency, the 
equation of motion of the absorber particle in the direction of the electric 
field & is given by 


mx + mvx = eS e (63) 

We note that the collisional damping term depends on i, whereas the 
radiative damping term depended on x. Since co 0 in the infinite future 
and |jc|/|ic | oc co~ 2 , the collisional damping term may well be more important 
than the radiative damping term, and we must therefore consider it as well. 

In the limit co —0, we have co < v, and it becomes meaningless to seek 
an oscillatory solution, since the particle loses its energy by collisions before 
any oscillation can take place. We therefore argue in the following way. 
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Neglecting time-variation of &, between collisions the particle acquires a 
velocity 


eS 

x ~ -. 

mv 



The kinetic energy of N particles is therefore 


1 6> 2 <F 2 

2 mv 2 




The electrical energy of the incident wave is T — & 2 /%tt per unit proper 
volume. Hence the attenuation is given by 


dT ^ AirNe 2 T 
dz mv 



in which TV is now the particle density, and where dz is an element of proper 
distance directed radially outwards. To obtain (66), one simply notes that 
the energy (65) is abstracted from the wave in a time v~ l , and that in this 
time the wave travels a proper distance dz = v~~ l . Using (44), we get 

dT 47TNe 2 y rji ( sn x 

—— ~-exp { * T. (67) 

dr mv 

The absorption integral is therefore of the form 

- f^^-eUr. ( 68 ) 

J mv 


For the steady state model, N = constant, v = constant, and the integral 
(68) is similar to (62) and therefore leads to the same conclusion. For the 
Einstein-de Sitter model, v may well behave in a complicated way because 
the velocities of the particles in the medium are dependent on r. However, 
the proportionality 


v oc N~ l (69) 

introduces strong convergence in (68), since N oc e~ 3 ^. Hence we expect (68) 
to remain convergent, and for the previous conclusions to remain unaltered. 
This question is further discussed in Chapter 9. 

As in the retarded case, we can do similar calculations for the advanced 
solutions. The results are summarized in Table 3.1. We should point out, 
however, that these calculations are based on an extrapolation of classical 
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physics to co —oo, with the added assumption that absorption cross sections 
saturate at finite values. These calculations are not as soundly based as those 

for retarded waves, since they ignore quantum effects, which are likely to 

be important. 

£ 

We can also perform similar calculations in the ^ = ±1 Robertson-Walker 

spaces, since they are also conformally flat. Such calculations have been 
made by Roe, and his conclusions are included in Table 3.1. 


TABLE 3.1 

Absorber properties of cosmological models. 


Cosmological 

model 

Future 

absorber 

Past 

absorber 

Electromagnetic 

propagation 

Minkowski 

Friedmann 
(k = 0, k = -1) 

Perfect 

Imperfect 

Perfect 

Perfect 

Ambiguous 

Advanced 

Friedmann 
{k = 4* 1) 

Perfect 

Perfect 

Ambiguous 

Steady state 

Perfect 

Imperfect 

Retarded 


§3.4. CONCLUSIONS AND RESTATEMENT 

OF PROCEDURE 

We have just seen that the Friedmann models with k = 0, — 1, do not give 
a perfect absorber along the future light cone, and hence are unsuitable 
for obtaining the empirically observed electromagnetic arrow of time within 
the framework of the Fokker action and of the Wheeler-Feynman theory. 
Moreover, models with k — +1 lead to the same ambiguous situation as 
did a static universe. It follows that, unless we abandon the present line 
of development and return to the unsatisfactory ad hoc choice of retarded 
potentials, and the unsatisfactory position concerning the self-force on a 
charged particle, we must discard the Friedmann models. 

The steady-state model, on the other hand, meets the necessary conditions. 
Retarded potentials are self-consistent but advanced potentials are not; so 
this model is free from ambiguity. However, we cannot conclude from this 
success that the steady-state theory is correct, because there may be other 
models, not considered above, which also meet the necessary conditions. 
We shall in fact encounter another such model, different from any which 
has hitherto been considered by cosmologists, in Chapter 9. 

It will be useful by way of concluding this chapter to restate the procedure 
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which has been followed. Now that the details of the various calculations 
are out of the way, it is possible to see the basic ideas more clearly. The 
electromagnetic field is formally symmetric with respect to advanced and 
retarded fields, and is given by 


S [^ (0, ret + ^advl- 


(70) 


a 


If the particle paths were known, (70) could be calculated from (3) and (4). 
But the particle paths cannot be specified independently of each other, since 
they are interrelated through (10). Because we have no a priori knowledge 
of these interrelations, we are in the awkward situation that until we know 
the paths we cannot calculate the fields of the particles, and until we know 
the fields we cannot calculate the paths. To break this deadlock, we make 
an assumption designed to permit the interrelations of the particles to be 
disentangled. We then proceed to calculate the paths and the fields, and 
we check to see whether the result is consistent with the initial assumption. 
If it is, a self-consistent cycle of argument has been established—we have 
a possible way in which the universe can behave. If several such cycles can 
be found, then we have several possible modes of behavior for the universe. 




universe should adopt—there would be ambiguity. But if we should find 


one consistent cycle of argument, and no others, we should have determined 


the only possible behavior of the universe. This is the situation we hope 


to arrive at. 


We have been especially interested in an initial assumption which states: 
“The interrelation of particles b a with the motion of any particle a is 


to be calculated using the fully retarded field of a” We hope to find this 
assumption self-consistent, because it agrees with experience. And we hope 
to find the corresponding assumption for advanced fields to be inconsistent. 
This distinction between retarded and advanced fields turns out to be related 
to the expansion of the universe and to the cosmological model. Hence the 
theoretical development is capable not only of resolving unsatisfactory 
features of Maxwell’s theory, but also of giving information which narrows 
the range of acceptable cosmological models. 




THE QUANTUM RESPONSE 

OF THE UNIVERSE 


§4.1. QUANTIZATION OF DIRECT-PARTICLE THEORIES 

(LECTURE 7) 

We have so far proceeded along classical lines. We have shown that the 
direct-particle approach to electromagnetism works at least as well as the 
Maxwell field approach in explaining all the classical phenomena of the 
interaction of charges, and that it links with cosmology in an interesting 
way. The choice of retarded potentials is not an ad hoc choice, but is dictated 
by the universe in the large. Moreover, the unbounded motions of charges 
moving under the self-force do not arise in this theory. 

Although success in the classical domain is necessary for any physical 
theory, it is not sufficient. Nature, as we understand it today, is quantum 
in character. In electrodynamics, quantum theory has unearthed a vast 
collection of phenomena outside the concepts of classical physics. These have 
been explained with remarkable success by the quantized version of Max¬ 
well’s theory, although there have been conceptual and mathematical stum¬ 
bling blocks too. Can the direct-particle theory do as well here, if not better? 

At first sight, an attempt to extend the classical direct-particle theory to 
include quantum phenomena seems unlikely to succeed. In Maxwell’s theory 
we have fields to quantize. The degrees of freedom of these fields result 
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in packets of energy called “photons,” which play such an important part 
in quantum electrodynamics. We have no analogous degrees of freedom in 
direct-particle fields. Thus photons do not appear to exist in the latter theory. 
The only degrees of freedom are those vested in the particles. Can we get 
all the conventional quantum electrodynamics by a first quantization alone? 
If not, the theory fails. If we can, however, the theory must be regarded 
as the superior theory, because it reproduces all observations under fewer 
degrees of freedom. 

Other difficulties can be anticipated concerning particles and antiparticles. 
In classical theory all world lines are endless and timelike. In relativistic 

J 

quantum electrodynamics, the world lines can go forward and backward 
in time. What happens to the “identity” of a world line under these circum¬ 
stances? How does the rule of no self-interaction operate under such condi¬ 
tions? 

These are some of the problems which arise when we undertake to 
quantize the Fokker theory. We shall proceed by stages in solving them, 
beginning with the simpler nonrelativistic picture and ending with fully 
relativistic interactions of electrons and positrons. 


§4.2. THE PATH-INTEGRAL APPROACH 

TO QUANTUM MECHANICS 

As we said, the only quantities which can be quantized in the direct-particle 
theory are the particle motions. The usual procedure is to start with a 
classical Hamiltonian and introduce operators. This procedure is not helpful 
here, since we have no Hamiltonian to start with. We have the action, the 
Fokker action, to describe the classical theory. The most suitable method 
of quantization is the path-integral approach developed by Feynman in 
1948. This approach has the advantage of using the classical action as its 
starting point, although it has so far been developed for nonrelativistic 
particles only. This restriction we shall remove later. 

The details of the method of path integrals can be found in Feynman 
and Hibbs (1965). Since the techniques are still not widely known, we shall 
g° briefly over some of the main ideas. 


Path Amplitudes 

Suppose a physical system has action S. This is defined in terms of “paths” 
r that the system can follow in coordinate space. Classical physics tells us 
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that not all r are permissible. In general, there is a unique path r from 
a given point P x in coordinate space to a given point P 2 . Writing r o for 
this path, r o is given by the principle of stationary action 

8S = 0 for r = r 0 . (1) 

Earlier, we have used this mysterious prescription, and have noted its 
remarkable success in classical physics. In 1938, Dirac suggested the inter¬ 
esting idea, later developed quantitatively by Feynman, that (1) is the 
consequence of a more general principle operating in quantum mechanics. 
In quantum theory the system is permitted any of the paths r from P x to 
P 2 , but each path has a definite probability amplitude proportional to 

exp (iS/ti), (2) 

where ti is Planck's constant. All amplitudes add, giving a total amplitude 
for the system to go from P x to P 2 . In the classical limit ti —0, and (2) 
oscillates wildly as we move from path to path—with the exception of T () 
where (1) holds. Paths in the neighborhood of r o make a significant contri¬ 
bution in this limit, whereas the contributions from other paths average out 
to zero. Hence the classical principle of stationary action. 

Feynman carried these ideas further by introducing the concept of the 
path integral. He defined the nonrelativistic quantum-mechanical propagator 
K{P 2 , P x ) for the system to go from P x to P 2 by 

K(P 2 , P x ) — ^ (constant)exp(/S//z), t 2 > t v 

r 

= 0, t 2 t x . (3) 

In (3) the action S is computed for each path T, and the sum is over all 
r from P x to P 2 . The constant is a normalization constant. If, as is usual, 
the paths form a continuum, the sum is replaced by an integral 

K(P 2 ; PJ = j exp [iS(r)/hyi>r, t 2 > t v (4) 

The integral is over the continuum of paths and is more complicated than 
the Riemann or the Lebesgue integral, which are summed over sets of points. 
Only limited progress has been made toward giving a rigorous mathematical 
foundation to this concept. Feynman was able, however, to obtain all 
required physical answers by various subtle devices. We shall draw heavily 
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upon these techniques in the subsequent work. The constant in (3) can be 
absorbed in the measure of °Dr . 

Suppose P 1 represents the spacetime point (x v t x ) and P 2 the point (x 2 , t 2 ) 
in the motion of a particle. Let t 2 ^> t v Any path r from P 1 to P 2 will pass 
through some intermediate point P having time coordinate t. Let the spatial 
position of P be x. Since the action functional is additive over paths, we 

can write 


s(r P2Pi ) = s(r P2P ) + s(r PPi ), ( 5 ) 

where the path r p p from P 1 to P 2 is made up of the segment r pp from 

w 1 

P 1 to P and the segment r pp from P to P 2 . Hence 

2 

exp [iS(r P2 p)/h] = exp [/£(/" p 2 p)/fi\ exp[fS(T ' PP Jfil (6) 

The sum over all paths from P 1 to P 2 can be obtained by summing all paths 

r pp and r p p and integrating over the spatial coordinates of P. From (6), 

1 2 

together with an appropriate measure for the path integral (4), we get 

K(x 2 , t 2 ; x v /j) = / K(x 2 , t 2 \ jc, t) K(x, t; x v / x ) d 3 x . (7) 

Together, (4) and (7) suggest an alternative way of defining the path am¬ 
plitude. Suppose we divide r p p into a large number of small segments 

2 1 

with intermediate points Q r (r — 1,. . ., N — 1). We can define O 0 = P x and 
Q n = P 2 . Over each segment ( Q r _ v Q r ) we may imagine S to change very 
slowly. We define the amplitude for such a segment to be proportional to 
K(Q r ; Qr-i)- The amplitude along the entire path is then given by the 
product 


N 

Hm HArKiQ+Q^J, (8) 

N-* OO 

r ~ 1 

where A r is a constant of proportionality with the dimensions of spatial 
volume. Summing over all paths gives 

A r K(Q r - Q r _ tfor =/...} K(P 2 ; Q N _ t ) K(Q N _ x ; Q N _ 2 )... 

ml 

d 3 Qi = K(P 2 ; P i) (9) 



■ ..K(Q 2 > QdK{Q l \P 1 )d 3 Q N _ 1 . 


# • 
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by (7), the integrations of d 3 Q r being over the time sections t — t r \ 
r = 1 ,..., N — 1 . 

The constants A r are again absorbed in the measure of 7\ Because of (7), 
the definition (8) leads to the same function K(P 2 ; P^) as before. 

Sometimes we know K but not S'. Then (8) is useful to define a path 
amplitude. The original definition (2) is the more direct one, however. We 
shall use (8) in relativistic path-integral theory. 


The Wavefunction. 

Suppose that, instead of knowing that the particle is at (jc 19 t A ), we only know 
the probability amplitude \p(x v r r ) for it to be at various spatial positions 
x x on the time section t = t v We then ask, what is the probability amplitude 
c p(x 2 , t 2 ) of finding the particle at (jc 2 , t 2 )l This is obtained by the weighted 
mean of K(x 2 , t 2 \ x v t x ), 

<t>{x 2 , t 2 ) = J K(x 2 , t 2 ; x v t x ) \p(x v t x ) d 3 x v (10) 

As t 2 —/j, \p. This implies 

lim K(x 2 , t 2 \ x v t A ) — 8 3 (x 2 - xj. (11) 

t 2 -*t 1 

The function 4> is the usual Schrodinger wavefunction. Does it satisfy the 
Schrodinger equation? The answer is “yes,” and we illustrate this by a simple 
example. For a particle moving in a potential field V, we have 

S=((±mi>-v)d,. ( 12 ) 

If we substitute this in (4) and use (10), we find that <f> satisfies the differential 
equation 

ti 2 d<p /1 o \ 

F +¥<}> = ih ^. ( 13 ) 

2m ot 2 

This remarkable result is proved in Appendix 2, p. 216. In view of (11), 

and the fact that K = 0 for /., < /, (no propagation backwards in time), 
(13) implies that K satisfies the equation 

F., 2 + 1 y) K(x 2 , t 2 ; Jf„ tj) = S 3 (x 2 - -v,) 8(t 2 - fj). (14) 

\ dto 2 m “ n 1 
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Transition Probability 

Let the particle be in an initial state ^(x, t x ). We wish to determine the 
probability that it is in state <j> f (x , t 2 ) for t 2 > t v Using (10), the amplitude 
is given by 

I — // <pf*(x 2 , t 2 ) K(x 2 , t 2 ; x v tj) <#>{( x v t x ) d 3 x 2 d 3 x 1 

-iff <P f *(x 2 , t 2 ) exp [iS{r 2l )/h) 4> t (x j, t^dT. 21 d z x 2 d 3 x v (15) 

where * denotes complex conjugate. 

The transition probability from 4> i to <j> f is given by the square of the 
modulus of (15), i.e., by 

4>f) = K<#vl<f>i>| 2 

-If.If if Pf ( x 2> ^2) €Xp[/S(/T>i)/^] ( t > i( x i’> ^1) 

• 4>*(x' v u) exp [-iS(r' 21 /h)](f> f (x' 2 , t 2 ) 

• ®r 21 ®r’ 21 d 3 x x d 3 x 2 d 3 x j d 3 x 2 . (16) 

We shall often refer to T' 21 as a conjugate path, implying that the action 
along r' 2 i is multiplied by —i instead of + /. 


Perturbation Theory 

We now consider the problem of perturbation expansion from the path- 
integral point of view. Suppose we know the behavior of a system with action 
S 0 . Let the system be disturbed by a potential V(x, t) over the time interval 
(t v t 2 ). The action during this interval is therefore 

S’ = S 0 - f 2 V(x, t) dt. (17) 

J h 


We now wish to determine K v (x 2 , t 2 \ x v / x ) in terms of the unperturbed 
propagator K 0 (x 2 , t 2 , x v ^). We have 


K 0 (x 2 , t 2 ; x v M 


f a 

^0(^21) 

exp 

L n J 


C\ 


vr 


21 


(18) 
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Expanding the exponential involving the integral over K then 


^v( x 2> ^ 2 > *^ 1 » ^1) 



exp 


^0(^21) 

ti 






The path integral for the unity term in this expansion gives A',,. We consider 
the next term 



The integral f Fn V dt is over a specific path T 21 , given by a function x(t). 

Suppose we take a particular instant t of time and take all paths which pass 
through x(t) on their way from P 1 to P 2 . If we sum over these paths alone, 
we will get, as in the analysis leading to (7), the product 


K 0 (x 2 , t 2 \ x , t)K 0 (x, t\ x v tj. 



If the integral over V were absent, we would simply have got (7). But now 

/ F(x, t ) 

we have to weight (22) with--2—. Then we have to sum over all x, 

n 


to include all paths from 1 to 2. Finally we perform the time-integral to 
get the entire contribution: 


1 

h 


to 


/ K 0 (x 2 , t 2 , X , t) V(x, t)K 0 (x , t; x v t x ) d 3 x dt. 

J t , 


(23) 


The physical meaning of this operation is illustrated in Figure 4.1. If we 
consider a given t and x, we imagine the system to proceed undisturbed 
from (x v t x ) to (x, t). Then it is scattered by V(x, /), after which it proceeds 
undisturbed from ( jc , t) to ( x 2 , t 2 ). This scattering could occur anywhere 
within the spacetime slab t x < / < t 2 . Hence the integration in (23). The 
four-dimensional volume element, d 3 x dt = dr, say, can actually be taken 
over the whole of spacetime, since one of the K 0 functions involves propa¬ 
gation backward in time. The integrand is therefore zero whenever the point 
x, t, falls outside the slab t x < t < t 2 . 

Higher-order terms in the expansion give multiple scattering processes. 
Thus (19) is the closed form of the usual infinite perturbation series: 
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A^(aC 2 , ^2’ X V C) — Kq(X2, t 2 \ X /j) 




Transition Element 

Classically we are used to continuous changes of dynamical variables. In 
quantum mechanics, transitions are in general discontinuous. Can we give 
meaning to “velocity” or “acceleration” in discontinuous transitions? In the 
path-integral formulation, we can indeed give a meaning to such terms, by 
means of the concept of transition elements. Suppose and <$> f are the initial 

action S. We have already seen 
that the probability amplitude can be defined by the path integral 

<<^|<^> = fft <p* exp (iS/fyfyfdr d 3 x 1 d s x 2 . (25) 


and final states of a system described by 


Suppose we now have a functional F[.T] of a path f. Then the transition 



V(x, t) 


FIGURE 4.1 

Scattering due to a potential field K(jc, t). 




§4 2. THE PATH-INTEGRAL APPROACH TO QUANTUM MECHANICS 


57 


element of F is given by 



This definition means that <4> f \F |<|>;> is a certain kind of average over all 
paths suitably weighted by the initial and final wavefunctions. Classically, 
we could have calculated T[r] exactly, since we know that r = r o is the 
unique solution. In quantum mechanics, we cannot make such a definitive 
statement. 

Equation (26) is easy to apply in simple cases. For instance, for a free 
particle of mass m , the transition element of velocity is given by 



(27) 


As seen in this example, a transition element need not be real even if the 
original dynamical variable is real. 

Influence Functional 

We come now to the concept most useful for the quantum development 
of direct-particle theories. Suppose we have one quantum-mechanical system 
in interaction with another, and suppose we are interested only in the 
detailed behavior of the first system regardless of what happens to the 
second. We may speak ot the second system as the external environment 
of the first. To fix ideas, let the first system be described by a coordinate q 
and the second by Q. The combined action tor the two systems is taken 

to be of the form 


(28) 


£ — S 0 [q(t)] + S E [Q{t)\ + Sj[q(t), Q{t)], 


where S 0 represents the action of the first system alone, S E is the action 
of the second system alone, and Sj represents the interaction of the two 
systems. Let <#>*(< 7 ^ t t ) be the initial state and t,) the final state ot the 
first system. Using (16), the probability of transition <#>* <f>, is given by 








58 


4. THE QUANTUM RESPONSE OF THE UNIVERSE 


where 


F[q{<), qV)] 


f 


l 


exp | — S,[q(t), Q(t )] 





{{s,[q'(tl 0V)]} 




Here \p i is the initial state of the second system. We have summed over all 
final states \p f , since we are not interested in how the environment ends up. 

F[q(t\ q'(t)\ is called the influence functional and represents the "force” 
exerted by the environment on the first system. 

Feynman and Hibbs discuss influence functionals and their applications 
in some detail; so we shall not discuss them further. We expect the universe 
to act as the external environment in the quantum version of the Wheeler- 
Feynman theory. It is our aim to determine the quantum analogue of the 
response of the universe that earlier we calculated classically. This response 
will appear in the form of an influence functional. 


§4.3. ATOMIC TRANSITIONS (LECTURE 8) 

We now discuss how the Wheeler-Feynman theory operates in nonrelativistic 
quantum mechanics by considering a typical example—that of atomic 
transitions. Consider an atomic electron making transitions between states 
of energy E n and E m . If E m > E n , we know that the electron jumps "down” 
from the state of energy E m to the state of energy E n even if no external 
field is present. Such jumps are called spontaneous transitions to distinguish 
them from induced transitions caused by ambient electromagnetic fields. 
The latter are symmetric with respect to any pair of states; i.e., the proba¬ 
bility of jumping down is the same as that for jumping up. In Maxwell field 
theory, the spontaneous transitions are explained by means of field quanti¬ 
zation in which fluctuations of the vacuum (no field) state play a crucial 
part. It would seem, therefore, that here at last we have a phenomenon which 
really depends on the “field” character of electromagnetism. How can the 
direct-particle theory explain the downward transition of an electron when 
apparently there is no electromagnetic disturbance present? 

The clue to the answer to this question is found in the classical formula 

for a universe that is completely absorbing along the future light cone. We 
have 
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In (31) the first term on the righthand side shows how other charges b 
influence the motion of the charge a via retarded fields. This term accounts 
for the induced transitions of ci. The usual results for upward and downward 
transitions can be recovered from it. The second term, on the other hand, 
appears to concern charge a alone, and would be present even if the first 

term were zero. It is to this second term that we must look for spontaneous 
transitions. 

We have seen how the second term arises. It is the response of the rest 
of the universe to the retarded field of a. This response arises instantaneously 
with the outward movement of a disturbance from a . In the quantum theory 
of atomic transitions, we expect the following chain of events: an electron 
a initially in an upper-energy state could stay there, or it could jump up 
or jump down. Quantum-mechanically, we must consider all possible mo¬ 
tions of a , not just a unique classical one. Each motion would invoke a 
response analogous to (31). We then ask whether this response is consistent 
with the over-all transition of a , and if it is, we calculate the probability 
of transition. It is the nature of the response from the universe which will 
decide whether there should be transition at all, and, if so, with what 
probability. Thus the apparently local process of spontaneous transition will 
turn out to depend on the large-scale structure of the universe. 

To fix ideas, let us consider the motion of a in the time-interval 0 < t < T 
(see Figure 4.2), and denote the displacement of a by a(t). Let b be a typical 
absorber particle whose world line is intersected in intervals A_ and A + , 
respectively, by the past and future light cones from the initial point [a(0), 0] 
and the final point [ a(T ), T] on the path a(t) of a. From what has been 
said above, the induced transitions of a arise from its interaction with the 
retarded field of b , i.e., from the portion A_ of the world line of b. The action 
governing induced transitions is therefore 


T 



(32) 


where A ret (b) is the full retarded 3-potential from b. To calculate spontaneous 
transitions, we need, on the other hand, the transitions of b induced by the 
full retarded field of a . The action governing this is 


; 4 ret (a) (£) * bdt. 


( 33 ) 
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Response for spontaneous 
transitions of a 


± Source of induced 
transitions of a 


FIGURE 4.2 

The induced transitions over the time-interval 0 < t < T ot the 
particle a arise from interaction with the past segment A_ of the 
path of particle b, whereas spontaneous transitions of a arise from 
interaction with the future segment A + . 


with a similar notation. As in the classical case, we are looking for a self- 
consistent cycle of argument in which the net field is the retarded one. 
Throughout this calculation, we shall work in a conformally flat cosmological 
model with a perfect future absorber. 

Our aim is to calculate the influence functional governing the motion of 
a and arising from the whole universe. Since we may assume the different 
absorber particles to act independently, the influence functional has the form 

F[a(t), a '(/)] = ]T ^>(0, «m (34) 

by£ a 


where F {b) [a(t), a'(t)\ is the influence functional exerted by a typical par¬ 
ticle b. 

To calculate F (6) [a(/), a'(t)], we consider all transitions produced by (33) 
in the absorber particle b. If ipj(lf) is the initial wavefunction and \p f (b) any 
final wavefunction of b , we have from (30) 
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F^[a{t\ a\t)] 


sffff d % d3b f d % d3/ >'n (35) 

f 


where 


y <w = j j exp ±{S E [b(t)] - S*P'(0] + S,W0. b(t)} 

- S^a'U), b'(t)}) • ®b®b'. (36) 
With the interaction governed by (33), we write 


S 7 K4*(0] = -e b J Aj«\b)-bdt (37) 

in (36), with a similar expression for Sj[a'(t), b'(t)]. 

Because of the cooling produced by the expansion of the universe, b is 
usually in its ground state. We shall take \p i to be the ground state. We can 
expand the part of the exponential in (36) containing the interaction terms 
only. The unity term in the expansion corresponds to Thus we 

have 


F ib) [a, a') = 1 + V fff f ff ^(bfWfib'^'ib'^ibi) 

f?i 


1 


n 2 



e b I A T J a \b)-bdt 



A re t a \b') * b' dt 


+ 


+ 


• exp 


(t ( S e [*(')] - 5 el /, '( / )l}) d 3 b f d 'h l d 'b\ d 3 b' f 


+ terms 


m [/, I u. 


+ . . . , 


(38) 


where A ret (a) is calculated for the path «(/), A r J al for the conjugate path 
a'(t). 

Let E, and E i be the energies of states i f and tp { , respectively. Then 
first-order perturbation theory shows that the contribution to the above 
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expression from the transition xp i — \p f is 





where 




~ i(E f - E t )t ' 
h 


dt f^(b)Aj a \b)-b^(b)d 3 b. 



To calculate (40), we proceed as follows. Take an origin near a to measure 
the displacement a(t). Similarly, let b(t) measure the displacement from an 
origin near b. Let the origin near b have a relative displacement R with 
respect to the origin near a. Then the vector from a(t) to b(t) is given by 

r — R + b — a. (41) 


For \R\ > |/>|, \a\, we get 




In the absence of any dispersion, we would have, in the Coulomb gauge. 



where a ij) are two unit vectors which form a mutually orthogonal triad with 
r/\r\. 

The quantity r — r- a varies only slightly with />, and it is sufficiently 
accurate to replace it in (43) by R — R • a. 





We Fourier-analyze (44) to get 





where 
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a f a / 277 / 7 /'\ 

Rr)T-a-R/R exp l— r 


The range T' of /' is given by 


t' — t + R + ~ /?•(/>_ a ), 

K 


0 < t < T. 


Since b , R, are not to be regarded as varying with /, we have 


dt' 

dt 


1 


aR 

R 


so that changing the variable from /' to t in (46) gives 


A 


a 


RT 


7 exp 


277/7 / /? • b 

-( R H- 

L T' \ 


R 


T 

f - 


exp 


277/7 / 

T r \ / 


a • /? 


( 46 ) 


(47) 


(48) 


dt, (49) 


with 


r = r - |>(r) - «(0)]. 



The effect of dispersion in the cosmological medium is to introduce both 
a phase change and damping into (49), modifying it to 



the phase change being expressed by Xi an d the damping by 

We now substitute (45) into (40). We shall make use of the fact (to be 
shown later) that Xi i s a ver y large phase angle, and use this to wipe out 
any cross products of A x , A v which arise in (39), except those with l' — —l, 
giving 



( 52 ) 
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in which we have already averaged with respect to all orientations of b to 
remove product terms in the components of a (1) and a (2) , and where 


A l 2 * 1 

Mt 


~j is the matrix element of b exp 


.2711 Rb 


i 


T R 


with respect to 


\f/. and \p f . We have also ignored variations of b over A + in comparison 
with T'. Expression (52) can be simplified further by the results 



T' 


exp 


o 


U \ E f 
ti ' / 


277 til 

E: - dt' 

l yv 



lirT'ti S I E, - £. 


GO 


2 
l — —oo 


sin 2 a 


(77/ — a ) 


f 


1. 


IttITi \ 

-jr) < 53 ) 


(54) 


Writing 


tik = E f — E { , k 


kR 

R 


(55) 


and using (53) and (54), we reduce (52) to the form 


e -Hk^b if (k)\2 2 f a ^.ae~ ik - a + m dt 

i= i,2 J o 


3 R 2 h 2 


T 

X I a (j) • d'e ik ’ a '~ ikt dt — X (say). (56) 
J o 


Suppose that at a distance R in a particular solid angle d£2 , there are 
n(k)dk particles per unit volume with states f and i such that (E f — E^/ti 
lies in the range k to k + dk. Writing \b if (k)\ 2 for the average of \b if (k)\ 2 
for all systems satisfying this requirement, the contribution to F[a , a'] from 
all absorbers between R and R + dR and in dJ2 is 


|q _|_ ^ yi(k) dk R- 1 dR <U2 


(57) 


Since X is very small and the index is large, we can rewrite (57) in the form 


exp[2f • n(k) dk R 2 dR dfi]. 


(58) 


The function r(k) in (56 ) is just the optical depth of the absorbing medium 
at frequency k. It is not hard to show that 


dr(k) 

dR 


4t7 2 e 2 k 

V 


b if (k)\-n(k), 


(59) 
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the damping expressed by t (k) being due to induced upward transitions 
in the absorber. Thus (59) follows from first-order perturbation theory. 
Remembering that absorber particles contribute as a product, as in (34), 
we next integrate the exponent of (58) with respect to R and with respect 
to k. Using (59), and letting t —oo as R oo, we obtain 



It can be verified, again without difficulty, that the part (60) of the influ¬ 
ence functional leads to the usual spontaneous transition formula. Also, since 
the integration is over positive values of k only, we do not have spontaneous 
upward transitions. 

To complete the present problem, we still have to consider the terms in 
(38) involving [ fj J 2 and [ f J+ ] 2 . After an analysis similar to that given above, 
we finally get 

F[a, a'] = exp / dS2 f k elk 



J (ol^ • a)e xka lkt dt f (a^ • d)e lk ' a+lkt dt 


T t 

J (ol^ • a')e~ xk ' a + xkt dt f (ol/c^ * d')e lk ' a '~ ikt dt 


. ( 61 ) 


Here we have also integrated with respect to solid angle and have added 
the subscript k to a (;) to remind us that a (i) is a unit vector perpendicular 
to k. 

It is possible to arrive at exactly the same influence functional by a 
path-integral quantization of Maxwell fields. The present theory therefore 
is equivalent to the Maxwell theory in its quantum description of electro¬ 
magnetic phenomena that arise from nonrelativistic motions of charged 
particles. Just as in the classical case, the parameters that characterize the 
absorbing medium do not appear in the final result. The terms in (61) 
additional to (60) describe level shifts. 
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§4.4. THE ROLE OF COSMOLOGY (LECTURE 9) 

The expansion of the universe was not explicitly mentioned in the above 
analysis, although we used the important cosmological property of complete 
absorption in the future, by letting r —oo as R —oo. We now point 
out some of the cosmological issues involved in the calculation. To fix ideas, 
we choose the steady-state model as an example, since it satisfies the neces¬ 
sary classical requirement of perfect absorption in the future and of imper¬ 
fect absorption in the past. We shall use the conformal transformations 
described in the classical discussion, except that the time coordinates t and 
t will be interchanged to keep the same notation as in §4.3. Thus the 
Robertson-Walker line element will be written as 

ds 2 = dr 2 — e 2Hr [dr 2 + r 2 (d0 2 4- sin 2 0 d<j> 2 )], (62) 

and its conformally flat form as 

[dt 2 — dr 2 — r 2 (d0 2 + sin 2 6 d<j> 2 )]. (63) 

Because of the conformal invariance of electromagnetism, we used (r, 6, <f>, t) 
coordinates in §4.3, just as if we were working in flat space. 

In the spontaneous transition E m E n of the local system, the impor¬ 
tant frequencies involved in the influence functional (61) are those in the 
neighborhood of 



k — (E m - E n )/fi. (64) 

This frequency has to be matched by the value of (E f — E-)/ft for the 
absorber transitions xp t xp f . If, however, we wish to consider E t , E f , with 
respect to proper time at the absorber, it is necessary to take account of 
the redshift effect of the expansion of the universe. Thus when a wave, 
starting with frequency k at the source, reaches an absorber at coordinate r, 
the proper frequency has become 

<o = (1 — Hr)k, (65) 

and E it E f , with respect to local proper time are related to k by 

Ef — E^ — hk{ 1 — Hr). (66) 

If we displace the absorber by a proper distance dl away from the source, 
the frequency in tune with the wave changes by dto, where 
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dto = -Hk dr, dr = (1 - Hr) dl (67) 

The second relation in (67) follows from the conformal transformation. 
Hence we get 


— = — H dl. (68) 

CO 

Suppose the absorbers are effective over a range [co min , co max ] of frequencies. 
Then if k > co max , the range of / over which the absorbers can be in tune 
with k is 




H-1 ln . 

CO * 
mm 



For complete absorption, we need / —oo. Hence we need a process with 
comin 0. Collisional absorption provides such a process. 

The classical attenuation for a wave of frequency co over a length dl caused 
by collisions with frequency v is 



mco 2 



where N is the number density of electrons, e the electronic charge, and 
m the electronic mass. Using (65) and (67), the damping produced over the 
coordinate range 0 < r < R is 


exp 


iTrvNe 2 , 

r R dr 

mk 2 ^ 

(1 - Hr) 3 J 


For large R , appreciable damping occurs when 




l/rrvNer_ 

mk 2 H 




which corresponds to an effective proper frequency 

2 lirvNe 2 

ef£ mH 



For ionized hydrogen, v at a> e(f is given by 


















68 


4. THE QUANTUM RESPONSE OF THE UNIVERSE 


where y is a typical electron velocity. Taking H~ l ~~ 3 • 10 17 sec, 
N — 10 - 5 cm -3 , v ~ 3 ^ (of velocity of light), and substituting for e , m, k, 
in (73) and (74), we can solve for co eff and v\ 

co eff ~ 10 6 sec -1 , v ~ 10 -10 sec -1 . (75) 

Thus a wave with frequency greater than 10 6 sec -1 is first redshifted to 
10 6 sec -1 and then absorbed. The effective absorption takes place over a 
distance of the order of 10 28 cm. A wave with frequency less than 10 6 sec -1 
is absorbed without having to be redshifted. 

From (75) we find that the dimensionless parameter 


477 Ne 2 

mco* tf 



Since the real part of refractive index is given by 

1 2irNe 2 

n — \ -o » 

mco eff 




we see that 1 — n ~ 10 -7 . 

Suppose we take two Fourier components k and k + Ak given by neigh¬ 
boring values of / in the analysis of §4.3, 


k 


2ttI 


k + Ak 


27 j(1 + 1 ) 

T 


(78) 


These are in tune with absorber frequencies co, to + Ao, where 


Aco _ Ak 

co k 


1 

/ 


(79) 


The difference in n for these two neighboring frequencies is 


An = *lNel A( . 


10 


mco 


l 


(80) 


when co co eff . Over a distance H 1 this causes a phase difference 


Amo ei{ H 1 


\0~ 7 co^H 


eff 



277 

~kV 


(81) 


For an allowed transition, T'k does not exceed unity by a factor of 
more than ( e 2 /h )~ 3 . Thus the phase difference is greater than 
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— 10 12 co eff // 10 11 radians. This explains why even neighboring fre¬ 
quencies become randomly phased, and why we were justified in omitting 
crossproducts in (39). Thus random phasing is of cosmological origin, and 
does not have to be assumed ad hoc as in the field theory. 

These numbers could change somewhat if other cosmological models with 
appropriate absorption properties were used, but the basic ideas would 
remain the same. 


§4.5. THE PHOTON AND THE THERMODYNAMIC 

ARROW OF TIME 


How does the photon fit into the above picture? First, we note that when 
an atomic electron jumps down, it loses energy E m — E n — hk. The energy 
is not emitted classically in an isotropic way, but as the influence functional 
(61) shows, it is emitted in a certain direction in a solid angle dti with a 
certain probability. We may describe this exchange of energy between the 
source and an absorber in dQ as an exchange of a photon of energy hk. 

Suppose further that there is an external direct-particle field present, as 
given by the first term on the righthand side of (31). The electron then jumps 
up and down between the states with energies E n , E m with probabilities 
which satisfy 

P(E rn E n ) = P(E n E m ) + probability of spontaneous 

transition E m — E n . 


Now we can write 


P(E m -+E n ) «+l 

T77F-FT -- (° z ) 

P&n E rn ) n 

where n is some number. Since the induced transition probabilities are 
proportional to the intensity of the incident field, we can say that the 
intensity is that of n photons. 

In this definition of the “number of photons” in the external field, we 
use the spontaneous transition probability as an absolute unit. Indeed, this 
definition of the photon is physically more realistic than the abstract one 
given by the quantization rules of field theory. Physically, a photon is seen 
either when it is emitted or absorbed—i.e., when it begins or ends on a 
particle world line, and the process of measurement of its energy causes 





70 


4 THE QUANTUM RESPONSE OF THE UNIVERSE 


transitions in the measuring equipment. It is this effect that we are using 
here to define the photon. 

Equation (82) also gives us a clue to the connection between the arrows 
of time in cosmology and thermodynamics. As is well known, (82) leads 
to the blackbody spectrum of thermodynamics; yet (82) arises here from 
a cosmological asymmetry which permits only downward spontaneous 
transitions. If the cosmological medium could act as a maser, augmenting 
the outgoing wave systematically instead of absorbing it, we could achieve 
spontaneous upward transitions. It may be possible to formulate a consistent 
theory with spontaneous upward transitions in a contracting universe which 
gets hotter in the future. In that case one would expect “population inver¬ 
sion” and thermodynamics going “backwards.” Further work is necessary 
to investigate this possibility. If successful we could argue that there is a 
strong interrelation of the time arrows of cosmology, thermodynamics, and 
electrodynamics. 

This completes the nonrelativistic discussion. We now turn to the inter¬ 
actions of relativistic spin-half particles. We shall put fi = 1 from now on, 
thereby eliminating a redundant unit. 


§4.6. PATH AMPLITUDES FOR DIRAC PARTICLES 

(LECTURE 10) 

There are two ways of approaching the problem of defining path amplitudes 
for relativistic spin-half particles. The first one, making use of the Feynman 
method based on (4), presents difficulties at first, since we have no clear 
way of writing S for such particles. The formula 

S = — f m a da (83) 


is not suitable in itself, since it does not contain spin. We shall consider 
this problem later. 

The second approach consists of extending (8), using a suitable propa¬ 
gator. Direct analogy with the nonrelativistic case suggests that we take 
K(2\ 1) as the solution of the inhomogeneous Dirac equation 



+ im 


K( 2 ; 1 ) 


= « 4 (2, 1), 



which satisfies the boundary condition 
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K( 2; 1) = 0 for / 2 < t v (85) 

Using the free-particle solutions u n of the Dirac equation, we can write K 
explicitly as 


Ay ( 2 ; 1) = 0(t, 


t 


l) 2 "n( 2 KO)- 


n 



where 0 is the heaviside function and u n = u* y 4 . We shall also use the 
notation 


7% (87) 

The sum ( 86 ) is over all energy states, negative as well as positive. This 
poses the usual difficulty of particles with negative energy states. Feynman, 
who used A 0 + (2; 1) in defining path amplitudes, had to postulate a vacuum 
filled with negative-energy particles. Thus even to describe the motion of 
a single particle using ( 86 ), we have to take into account a background with 
an infinity of negative-energy particles. 

To avoid this situation, we proceed in a different way. We introduce two 

types of particles—those with positive energy always going forward in time, 
and those with negative energy always going backward in time. These may 
subsequently be identified with particles and antiparticles, e.g., electrons and 
positrons. 

We therefore introduce two types of paths, A+ and r~ (see Figure 4.3). 
The r + are always going forward in time, and amplitude along them is 
defined by ( 8 ), using K 0 + : 

N 

P(r + )= lim 1 { A r K 0 + (Q r ; Q r _,). ( 88 ) 

N~*°° , 

r — 1 

The r~ paths are always going backward in time, with amplitudes 

N 

P(r-) = lirn Y[A r K 0 -(Qr-,Qr-i)> ( 89 ) 

r zz 1 

where the points Q r form a reversed chronological sequence, and K 0 ~ is 
given by 


K 0 ~( 2; 1 ) = -6(t 1 - t 2 ) V u n (2)u n (\). 


( 90 ) 





72 




r - 7 

r Q r 

( FIGURE 4.3 

• Q r + 1 Forward and backward paths. 


P-2 

We make a similar distinction for wave functions. Suppose \p(x, /) is given 
on any spacelike surface. We then write 

Xp(x, t) = \p + (x, t) + Xp_(x, /), (9 1) 

where is made up of positive energy states only and \p_ is made up of 
negative energy states. The ip + part is propagated forward by the r + paths 
while the part is propagated backward by r~ paths. 

Suppose we have a slab of spacetime defined by t x < t < t 2 . Let \p + be 
given on t = t x and if/_ on t = t 2 . Then using arguments similar to those 
in the nonrelativistic path-integral theory, we get \p(x, t) as 

>P(x, t) = J K 0 + (x, t; x v ^ 1 )y 4 ^ + (.v 1 , tj d 3 x 1 

— J K 0 ~(x, /; x 2 , l 2 )y 4 t[/_(x.,, r 2 ) d 3 x 2 . (92) 

The y 4 factor arises from the necessity to preserve invariance. Equation (92) 
can be rewritten in the form 

yp(x, I) = J K + (x, /; Xj, ? 1 )y 4 ^(at 1 , /j) d 3 x l 

— J K + (x, t; x 2 , t 2 )y 4 xp(x 2 , In) d'-Xn, (93) 
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where we no longer restrict \p to the positive energy component on t = t 1 

and to the negative energy component on / = t 2 , but dehne a new propa¬ 
gator 


s 


KA2, 1 ) 


y “n( 2 K(*)» ‘ 2 >‘v 


E„>0 


y u n( 2 )“,(')> t 2 < t V 


«»<<> 



Thus in the first integral on the righthand side of (93), only the positive- 
energy part ip + is propagated to ( x , /). Similarly, only the negative-energy 
part is propagated to (x, t) from (x 2 , t 2 ). 

To summarize, the total amplitude at a point within the slab t x < t < t 2 
is determined by r~ paths from the future as well as by r + paths from 
the past. Since are zero outside the light cones, r ± paths are always 
timelike. We can build up more complicated paths (see Figure 4.4) by joining 
r + and r~ paths in any order. For propagation of a free particle, however, 
such zigzag paths make no contribution, as can be easily verified from the 
above definitions. 

Next we introduce the electromagnetic potential B v The path amplitude 
for a charged particle a is now given for a path by 





FIGURE 4.4 

Path reversals caused by the electromagnetic field A x . If A { were zero at a reversal 
point, the path would make no contribution to the amplitude. 
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P B (r + 21 ) = P(f+ 21 )exp -ieB i da i , (95) 

r»+ 

1 21 

where P(r + 21 ) is the free-particle amplitude along C + 21 from 1 to 2. 
P B (r- 21 ) is similarly defined. 

If we consider the same problem as above, i.e., given ^ on two planes 
t — t x and t = r.> > to calculate \p(x 9 1) at an interior point of the slab 

t 1 < t < t 2 , we get, instead of (93), 

4>(x, t) = | K B + (x, t; x v t 1 )y 4 xp(x 1 , t 4 ) d' i x l 

— f K B + (x, P, x 2 , t 2 )y 4 ip(x 2 , t 2 ) d 3 x 2 , (96) 


where 

K B + (2; 1 ) = K + ( 2 ; 1 ) - ie f K + ( 2 ; 3)tf(3)K + (3; l)dr 3 

+ (~ie) 2 f f K + (2;3)0(3)K + (3;4)0(4)K + (4; \)dT 3 dr 4 + .... (97) 

(This result is derived in Appendix 3.) K B + { 2, 1) satisfies the equation 

at 2 + ie ^ i) = 5 4( 2 ? i). ( 98 ) 


Many-Particle Systems 
For a single particle we write 

K B +(2, 1) = | P(r 21 ) exp y J — ieB i da^Vr 2v (99) 

where F 21 can be made up out of any number of r + and f~ paths joined 
in any order. We shall consider (99) further later. 

For a system of many particles we have a multiple-path integral. To write 
it, we change our notation slightly, denoting the initial and final position 
of the a th particle by a t and a f , respectively. Chronologically, a i may not 
occur before a f , since paths can go backward and forward. A typical path 
joining a { to a f will be denoted by T fl . Then the propagator for a non¬ 
interacting system of particles in a given field B i is formed as a product 
of (99) taken for each particle, 
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K B + [. 


a n a i> 






( 100 ) 


If, on the other hand, the particles are mutually interacting, (100) is changed 
to 



where 



In (102) we have apparently included-self action. We shall return to this 
point in some detail later. 

We note that if we are dealing with similar particles, the exclusion princi¬ 
ple requires the initial wavefunction to be antisymmetric. If the initial 
wavelunction is T[..., a v ...], then the final wavefunction given by 

T[..., a h .. .1 = 


/■/*'* 


+ R) 


+ 


a,;... JIJ f/l a ^ [ • • • > Cl i ,... ] TT"- (* 03 ) 


a 


a 


is also antisymmetric. In (103), each a i runs over a closed three-dimensional 
surface, with normal nj and surface element dS a at a typical point. 

Equation (103) requires us to antisymmetrize the amplitude at each stage. 
Thus (see Figure 4.5) if two electrons start at 1,2, and end at 3, 4, we have 
to calculate amplitudes along paths T 31 , T 42 , multiply them, and then 
subtract the amplitude product for paths r 41 , r 32 . This immediately leads 
to closed-loop paths in perturbation expansions. For example (see Figure 
4.5), if the electron goes from 1 to 3, is scattered back to 2 and then forward 
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3 



3 4 



FIGURE 4.5 





In two-particle scattering, the exclusion principle requires the amplitude for (b) to be 
subtracted from that of (a). The same principle, when applied to multiple scattering of a 
“single” particle by electromagnetic potentials, leads to closed loops, as shown in (c) 
and (d). 


to 4, we must subtract from the amplitude along the path 


r r r 

1 42 1 23 i 



the amplitude along 


r r r 

1 4X 1 23 1 



(104) 


(105) 


In (105) we have a particle moving freely from 1 to 4 and a closed loop 
at which the potential acts twice. 

A closed loop, r° 9 is made up of f ± paths, and its amplitude can be 
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similarly defined. However, since here the ends are identified, the amplitude 
becomes a trace: 


P B (f°) = 


Tr 


n w 




(106) 


Thus P B can be evaluated starting from any point of r°. Equation (106) 
also gives the relation 

P B (r {) ) ~ exp ^ — ie£ B } dlP(r°), (107) 

where dl x is an element of length along r° and e is the charge of the particle 
giving rise to the loop r°. We shall use these results in the discussion of 
vacuum polarization. 

We come now to the status of the self-action term in (102). Once we allow 
paths to go backward or forward, we cannot properly identify two distant 
sections of world lines as belonging to distinct particles. This inability 
requires the 8(s 2 ) interaction to act between all sections. In the nonrelativistic 
theory, all paths are forward-directed and are timelike. The interaction 
8(s 2 ab ) shown in Figure 4.6 therefore vanishes unless A and B coincide. So 
long as we avoid this possibility, classical theory and nonrelativistic quantum 
theory remain unchanged by the inclusion of self-action in (102). 

We therefore make the rule that in the self-action we do not permit the 
two “legs” of the delta function to coincide. We may allow them to approach 
to a certain minimum distance. If we go to smaller separations, we would 
bring in true self-action, i.e., a particle interacting with itself. We shall give 
a quantitative expression to this idea in the next section. 





B 


FIGURE 4.6 

The self-action described by the propagator S(s 2 AB ) is zero when 
the separation of points A, B, is timelike. 
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§4.7. THE QUANTUM RESPONSE OF THE UNIVERSE 

(LECTURE 1 1) 

In the multiple-path integral (101), the electromagnetic phase factor is i times 

- 2 e a f B i dai ~ J 2 2 e a e b J f Wab) da i dt>l - ( 108 > 


For definiteness, we imagine a slab 0 < t < T, taking the paths r a , r b , to 
be segments inside the slab. There are interactions with segments outside 
the slab as well, and these determine B v Thus we can write 

B l = B\ <0 -j- B i t>T , (109) 

where the two terms on the righthand side correspond to effects of the past 
(t < 0) and of the future (t > T), respectively. We have seen that B l t<0 
does not have any complicated properties—it represents the retarded fields 
of other particles in the universe. It is the future component B\ > T that 
has the interesting properties. So to simplify discussion we will take 

B\< o = 0 (110) 

and concentrate attention on B\ > T . 

We begin with some definitions. We have already defined the direct- 
particle potential due to particle b : 


A«\X) = fe b S(s 2 xb ) db t . 


(Ill) 


Separating 8(s 2 XB ) into advanced and retarded parts. 




1 


2 Lv 


[6(/ — t B -f- \x — jc b |) -f <5(/ — t 


B 


B 


x - x B \)], (112) 


where X i = (jc, t ), B 


i x B' { b)^ we define correspondingly 


A/ b) (X) 


2 


[A/ b >(X) ret + A^X)^]. 


(113) 





2 Am (X) 

b a 


l 


2 


M < (a)(^) ret + ^(a)Wadv]- 


(114) 
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Thus 





adv 


2 v) 

b a 

f S( f - 

- >B ~ X ~ 

X B 1) db i 


x - x B 

LIU , 

2 e bj 
b a 

f S(t - 

- l B + Y ~ 

- x R \) 

B ' si hi 

1 

X - x B 

6/ L/ • 


(115) 


In the classical case, with the omission of the self-action terms, (108) can 
be written in the form 


zll e a f B\> T^ a i y 2 e a f t^ l (a)ret + ^Voadvl ^ a i' (H6) 

a a 

We found in the classical case that, in a universe with consistent retarded 
solutions, the first term of (116) cancels the advanced component of the 
second term, doubles the retarded component, and produces the Dirac 
radiative reaction. Thus in the classical case the response of a completely 
absorbing universe may be expressed in the form 

> r = y 2 K (w 0) ret - Oadv]. (117) 

Z b 

When we substitute (117) into (116), we get 

- 2 e a f { A to) ret + J M^ret “ ^“ady]) da v ( 118 > 

a 

We now generalize (117) to quantum electrodynamics. To do so we refer 
back to the detailed nonrelativistic discussion in §4.3. To determine transition 
probabilities, we had to take the square of the modulus of certain path 
integrals. This brought in conjugate paths as well as paths. The appearance 
of both paths and conjugate paths was inevitable, since the local transitions 
came through the response of the universe, which in turn came through 
transitions in the absorber. In working out these transitions, one must make 
a probability, not an amplitude, calculation. Thus the response of the 

universe acts via both paths and conjugate paths. 

The same situation occurs here. We have to take the square of the modulus 
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of the path integral in (101). Corresponding to an integral over C a , we have 
an integral over r' a , the conjugate path. A typical expression in the expo¬ 
nential involving f a and f' a is i times 


in which 



A % (X )ret — A lia) (X ) ret + A\ a) (X) reV 
A\X )adv — A i(a) (X ) adv + A\ a) (X) adv . 


(119) 



The difference from the classical situation is, as pointed out earlier, the 
inclusion of the self terms, those involving A l{a) in (120). To define the 
quantum response, we need the separation of various expressions into 
positive and negative frequencies. We shall use the following notation: 


A\X) 


ret± 


2 



AHX) 


adv± 


2 



s ± (t 

t iB 

- v - *bI) 


2 

1 * - x b 


s ± (t ■ 

“ r B + 

X 


2 

X — . 

X B 



db\ 


db\ 


( 121 ) 


where 


8+(x) = 


1 


77 



00 


_ ^(x) 


i d 


o 


77 X 


( 122 ) 


(122) is used extensively in quantum electrodynamics. Its properties are 
discussed in Appendix 1 (p. 209). Thus we have 


A 


ret 


A i i Ai 

^ ret+ ' ™ ret—’ 


Ji — A i 
adv ^ * adv+ 


+ A 4 


adv— * 


(123) 


We now give the quantum-mechanical analogue of (117): 


BAX) 


i v n > T 


1 

2 


2 (Mi <w O) 


ret+ 




+ [AA»\X\ e t _ 


^' <W W adv J}. (124) 
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The expression (124) remains unaltered if we interchange paths and con¬ 
jugate paths and positive and negative frequencies. Thus 

> T ~ > y. (125) 


Also, when paths and conjugate paths coalesce, (124) reduces to (117). This 
happens in the classical limit when there is a unique path for each particle. 
Finally, (124) transforms relativistically as a 4-vector, although individual 
parts of (124) such as A i rei+ do not transform relativistically. It can be 
verified that, apart from the classical combination (123), the expressions in 
the square brackets in (124) are the only combinations which transform as 
a vector. 

If we now consider a pair of particles a and b , and use (124) as the response 
of the universe, the electromagnetic phase factor involving a and b in the 
double path integral (see Appendix 3, p. 217) can be expressed in the form: 


e*P[ ie a e b f fs + (s 2 A - B ) da 'i d ^ - 

f f SJs- /ir ) da\db 1 

*A' 

tB > f A' 

+ f f s +( s2 B'A) da ‘ db 'i - 

- f f &-(S 2 AH’) da i db ’i 


^A > t B' 

~ f f S + (s 2 AB )da‘ db i + 

f f 8_(s 2 a > b -) da\ db' 1 j. 


(126) 


This is the influence functional F[a , b\ a', b'] for the interaction of particles 
a and b. If we use (126) we can treat the system as a local one in the slab 

0 < t < T. 

The delta functions in (126) can be expressed as three-dimensional Fourier 
integrals. Thus F[a, b; a\ b'} can be written in the form 


F[a , b\ a \ b'] 



+ ik • (x B — x A )] da i db l 


+ f J exp[iK\t A ’ — t B | + ik • (x A , — x#)] da\ db' x 

- f f exp [iK(t A - t B >) + ik • (x B . - x A )] da v db " 

— J J exp [iK(t B — t A ,) -1- ik • ( x A , — x B )] da \ db 1 1 J . 


(127) 


The terms in (127) have a simple interpretation. For t A > t B , the positive 
terms in the curly bracket contribute to a downward transition of particle 
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b and an upward transition of particle a, and vice versa tor t B ^> t A . The 
negative terms contribute to downward transitions of both a and b. Since 
the coefficient in front of all terms is the same, all these processes occur 
with equal probability. 

The case a = b needs separate discussion, although the details are not 
seriously different. The influence functional is now given by (see Appendix 
3, p. 217) 


F[a y a'] — exp \ ie 2 a 


f J 8 + (s 2 a , a ) da[ da' ~ J J S -^ 2 aa') da 'i da ' 


t. 4' > t 


A 


t A > t 


A' 




(128) 


Corresponding to (127), we have 

F[a, a'] = exp f dQ J Kdk[ - J j exp [iK(t A - t A .) 

4 - ik • (x A , - x A )] da* da[ + f f exp[iK(t A - t A ) + ik • (x A - .v A )] da 1 da i 

t A > *JC 

+ f J exp [iK(t A . — t A ,) + ik • (.v 4 - — xj,)] da’ 1 ^a'i]). (129) 

> tA■ 

This is the same result as the one obtained earlier using the nonrelativistic 
theory. The first term in the curly bracket gives spontaneous transitions, 
whereas the second and third terms contribute radiative correction effects. 
Provided we add similar expressions for interactions involving closed loops, 
all of quantum electrodynamics is contained in (126) and (128). As explained 
in the introductory chapter, we regard this result as a critical requirement 
for any new point of view in physics. Having reached this position, we can 
now extend our work outside electrodynamics. The above discussion has 
been kept as brief as possible, giving only essentials; further details are given 
in Appendix 3. 


Radiative Corrections 

We end this chapter with remarks on radiative corrections in the direct- 
particle theory. Once again. Appendix 3 (p. 217) contains a more extensive 
discussion of these matters. 

The main point on which radiative corrections depend is the fine structure 
of paths. We have already seen that r ± paths are made up of chains of 
K 0 propagators. Now K 0 + ( 2; 1) is given by 
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As the separation between 1 and 2 decreases, the delta function in (130) 
dominates. Thus af + path is made up of small null segments—if we look 
at it sufficiently closely. If in the self-action integral 



(131) 


the points A and A lie on the same null segment, s 2 ^ = 0 and da t da 1 is 
proportional to s 2 ^. Since \ 8(X) = 0, we get no contribution provided A 
and A are distinct. If A , A, are on different segments of a r + path, s 2 ^ > 0 
and 8(.s 2 a j) = 0. Thus the only contribution to (131) from such a path comes 
when A, A, are made to coincide. So long as we prevent this possibility, 
we should be able to avoid divergences. The same situation holds for r~ 
paths. On the other hand, we expect contributions to (131) from interactions 
between r + paths and r~ paths. These interactions are needed in the theory, 
otherwise there would be difficulty in understanding the interaction of 
particles and antiparticles. 

In calculating the integrals involving self-action, we use the rule 




1,2, 3,4, 


(132) 


to keep A and A apart, where e { is a 4-vector associated with the particle. 
Explicitly, we take 



(133) 


where p l is the 4-momentum, m is the mass, and e is a scalar length small 
compared to m -1 . 

We now calculate the energy of a free particle in the usual way, starting 
from (129). However, to avoid losing sight of (132), we must not perform 
coordinate integrations until after we perform the momentum integrations 
(the reverse of the usual procedure). The calculation leads to an increase 

in energy due to self-action by 

AUL = _ If! i n(we) (134) 

m 2 tt 


If we identify e with the gravitational radius 2 Gm of the particle (the 
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smallest known length), we get 

M \n(2Gm2). (135) 

m Z77 

This is a finite correction, of the order Am/m ~~ The theory is thus capable 
of giving unambiguous finite answers, but the exact nature of e remains to 
be understood. 

The same e occurs when we deal with closed loops. We have already seen 
how closed loops arise in the theory. In calculating a physical process, we 
must also include interactions with closed-loop paths in our influence func¬ 
tionals. This inclusion leads to a modification of effective charge in the usual 
way. The calculation is also similar to the usual one, but we must again 
perform the momentum integrals first and the coordinate integrals later, 
and we must keep the two legs of the delta functions apart by the rule (132). 
This procedure leads to a charge modification by 

ln(me). (136) 

e 3 7T 


With e = 2Gra, Ae/e ~~ — f. 

We also note that the quadratic divergence of vacuum polarization does 
not appear at all. This divergence is due in the usual theory to the constant 
part of the electromagnetic potential. But, as we see from (107), if B i is 
constant 



and no effect is produced by such a potential. This illustrates the advantage 
of being able to write the theory in a closed form. 

To summarize, by quantizing the direct-particle theory, we have shown 
it to be as comprehensive as the Maxwell theory. That all scattering and 
decay processes are contained in the influence functionals discussed here 
shows the power of the path-integral technique. The phenomena of sponta¬ 
neous transitions and decays of states are seen to be related strongly to 
cosmology and thermodynamics and to throw light on the nature of the 
arrow of time. These aspects of the theory are an advantage over Maxwell’s 
theory. Finally, radiative corrections can be made finite by a rule that is 
intuitively easier to understand than the usual techniques of renormalization. 






5 

DIRECT-PARTICLE THEORIES 
IN RIEMANNIAN SPACE-TIME 


§5.1. THE FOKKER ACTION (LECTURE 12) 


We now return to the question, raised already in Chapter 3, of how to 
generalize the Fokker action to curved spacetime. In this chapter we shall 
also consider certain general aspects of direct-particle theories. 

We begin with the concept of parallel propagators. Suppose two points 
A and B in Riemannian space are connected by a unique geodesic r AB . 
Let X % * be a 4-vector at point A. Since we are concerned with nonlocal prob¬ 
lems, we use A as a subscript on the vector index /' of X to show that X iA trans¬ 
forms as a vector at A. Suppose we move X'* parallel to itself along r AB , so 
that when it arrives at B it transforms as a vector X 1 b at B. Since parallel propa¬ 
gation preserves linearity, we can write 


= g l “ u X l \ (1) 

where the functions g\ depend on A, B, and on the geometry of space- 

A 

time. The index i A can be raised with the metric tensor g lk at A and the 
index i B lowered with g ik at B. Thus we have the quantities 



g 
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determining parallel propagation of covariant and contravariant vectors. 
There is symmetry with respect to A and B , 


oi A i B 


oi B i A ^ 


• • 


g 


l A l B 


• • 


g 


l B l A 



The functions (2) are called parallel propagators. Since they transform as 
vectors at A and B, such quantities are called “bivectors” or “two-point” 
vectors. We can similarly have biscalars and bitensors, etc. 

Because the proper distance s AB along f AB , 


Sab = I Vg ik dx { dx k , (4) 

r AB 

is a biscalar, we can give a meaning to 8(s 2 AB ) in curved spacetime. Thus 
it might appear at first sight as if the Fokker action could be generalized 
by replacing the flat-space form 

8 ( s2 AB)v ik dai dbk > ( 5 ) 

by 

8 ( s2 A B )gi A k B daiAdbkB ‘ ( 6 ) 

Although this is certainly a possible procedure, it does not lead to the 
equations of electrodynamics. The required generalization comes from the 
curved-space version of the wave equation 

^a^ab) = 4^ 4 (zt, B). (7) 

Indeed, we may regard 8(s 2 AB ) in (5) as the Green’s function of the flat-space 
form of the scalar-wave equation 

□<f> = 0. (8) 

In curved space, we again look for the Green’s function of (8). We shall 
also be concerned with the Green’s function of the vector-wave equation 

UA i + R*A k = 0. (9) 

Letting be a variable point with respect to which derivatives are taken, 
we consider the scalar Green’s function G(X, B) and the vector Green’s func¬ 
tion G ixiB as the solutions of the equations 


□G(W, B) = [-g(X, B)]~ l/2 8 4 (X, B\ 


(10) 
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□G- • 4 - R • (j . 

— l v l - 1 I 


X C B 


k vl 


X l B 


[-g(X,B)) 


- 1/2 


^)> 


(li) 


where g(AT, B) is the determinant of the matrix g i • . Since 

iV /? 


= Sik(B), 


( 12 ) 


the delta functions 6 4 (A r , B) in (10), (11) permit g to be replaced by g. How¬ 
ever, the writing of g and g t i in (10) and (11) serves to remind us that 
we are dealing with two-point quantities. We further choose the solution 
of (10) or (11) to be symmetric with respect to X and B\ 


G(X,B) = G(B, X), 


Gi 


l X l B 


Gi 


l B l X 



These Green's functions have been studied by various authors for other 
reasons (e.g., DeWitt and Brehme, 1960). We shall discuss briefly those 
properties which concern us directly. 

First, we have a relation between G(X, B ) and G i 4 : 

X B 



G(X, B)., 


;v 



This can be proved without great difficulty. We next write the formal 
solutions of (10) and (11) obtained by Hadamard’s method of series expan¬ 
sion: 


where 


G(X,B) 


G • 

l X l B 



u(x,B)e(s 2 XB )}, 

— V 'x i B 



(15) 

(16) 

(17) 


The presence of the heaviside functions in (15) and (16) shows that if we 
have a delta-function source at B, its effect propagates not only along the 
null cone from B , but also inside it. This breakdown of Huygen’s principle 
has no analogue in flat space. In flat space v = 0, v l<ie = 0, A -J. If we 
go back to (6), we now see that it forms the leading part of 4v but 

not the whole of it. 
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We are now in a position to generalize the Fokker action to 



a 


a < b 


The potentials and fields are defined by 




In view of (11) we have 



( 20 ) 


where 



8 4 (X, B) db { B 


( 21 ) 


has the required vector property of the current. Because of (14) we have 




Here we have assumed the world line of b to be between points B 1 and 

B 2 . If B 1 —>-oo, B 2 -*~ + oo (where — oo means infinite past and + oo 

means infinite future), we get zero on the righthand side of (22) for all the 
usual cosmological spaces. In such spaces the gauge condition is satisfied 
unless there is creation of charge. Henceforth we shall restrict attention to 
these spaces and assume no net charge creation, so that 



(23) 


We have therefore generalized the Fokker action to Riemannian space- 
time and in a way that preserves conformal invariance. Consider a con¬ 
formal transformation 



(24) 


and define 



(25) 
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where <p (a) satisfies the scalar equation 

*~^(a) + 2?;i[Aa)* + ] = 0 (26) 

with respect to the matrix g ik . It is not hard to show that 

^*(a)i ;i - 0, (27) 

and hence that A* ^ satisfies the starred form of the wave equation (20). 
The derivative in (27) is calculated with respect to the metric g* ik , as are 
the derivatives in the wave equation for A* (a)i . Yet the (^-functions in (25) 
do not affect F* ik , which remains equal to F ik . Nor do the (^-functions affect 
the electromagnetic part of the action, 

2 J 6,1 / A * ia)i ^ = 2 J e a f A (a)i dai ■ (28) 

a a 

The physical content of the theory remains invariant with respect to (24). 


§5.2. GRAVITATIONAL INFLUENCE OF DIRECT-PARTICLE 

FIELDS (LECTURE 13) 

Do direct-particle fields exert any gravitational attraction, as ordinary fields 
do? We shall discuss this question for the explicit case of the electromagnetic 
field. The direct-particle formulation of electromagnetism has only the single 
action term 




<T { da 1 a db l s 

l A l B 


whereas the Maxwell action has two terms 




The form (29) differs from the action used in Chapter 3, not only in hav¬ 
ing 477 ( 7 ; ; in place of $(s 2 AB ), but also in having unrestricted summations 
with respect to a , b. Thus in (29) we can have a = b. From what was said 
in Chapter 4 concerning self-action, it might seem that in classical theory, 
with timelike paths for the particles, the self-terms in (29) would be zero 
provided points A, B , were never permitted to coincide. This was the case 
in flat space. However, it is possible for G i ijt to be nonzero in curved space 
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even though A, B , are distinct points on the same timelike path, because 
of the v i i function which appears in [see equation (16)]. Hence the 
omission or inclusion of the self-terms is a more acute question in curved 
spacetime than it is in flat space. Since the self-terms are needed in quan¬ 
tum electrodynamics, it is correct to include them also in (29), even though 
the discussion is classical. To avoid infinities, points A and B in such terms 
must never be allowed to coincide. 

The gravitational effect of a field or of an interaction is expressed by an 
energy-momentum tensor, T ik , say, defined by expressing the change of the 
action corresponding to a variation of g ik , 

gik gik + 


in the form 


T ik 8g ik d*x, (32) 

V being the four-dimensional volume in which 8g ik is nonzero. Thus to 
obtain the electromagnetic energy-momentum tensor T ik iem) we have to 
calculate the variation of (29) for the direct-particle theory, and of (30) for 
the Maxwell theory. The result in the latter case is well-known, and is 

T\ m, = ± (jg ik F lm F lm - F“F k ,V (33) 



in which the electromagnetic field tensor F lm is taken to be retarded. 

Because neither A x nor da 1 changes when g ik is varied, the second term 
of (30) does not contribute to T lk {em) in the Maxwell theory. Thus (33) is 
derived entirely from the first term of (30). Since (29) has no analogue to 
this first term, but only one to the second term of (30), it might seem at 
first sight as if the direct-particle theory would lead to an energy-momen¬ 
tum tensor that was identically zero. Yet it does not, because a change of 
geometry changes the solutions of the wave equations (10) and (11). Con¬ 
sequently i in (29) changes. To calculate this effect, define 

j4 








The electromagnetic field generated by particle b can be written as 


pib) 
i x 


4776*1 



/ y 


’/• 


fV ^ Y % 


db x B , 


(35) 
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and the wave equation (11) can be written in the following way: 


V^g(g il g mk G K . mk + R im G mi ) 


d 


dx k 


(g U g mk V^g&mU.) + g U G k i ;kl 


S 4 (X,B)g\ 


(36) 


For simplicity, the subscript X on indices taken at X has been suppressed 
in (36). 

The change (31) does not alter <5 4 (T, B)g i i , while 


8 


0 


_ dx 


k 


(g il g mk V 




0 


0.X 




(g u g mk vT? Sir mll ) 


+ -^k^mn B 8 (S il S mk V^g)], (37) 


8 (g t, v-gG k i B -ki) = 8 


g 


U V 


g 


3 

(G\; k ) 


dx 


S(g il V^g) 


3 '~k 


dx 1 




+ V -"x g il ' ~1 3 



1 


a 




g 


dx k 


(V^gG k iB ) 


+ V 


gg 


il 


0 


1 


0 


dx 1 l V — g dx k 


[8(V=g g km )G mie ] 


+ V 


gg 


il 


0 


dx 1 


1 


0 




g 


dx k 


( y—g g km 8 G mi ) 


, (38) 


so that the variation of (36) gives 


9 (gilgmk ^f~g S o f ) + yC T g gn 


dx k 


0 


dx 1 


1 


0 


V-g 


,k 


(V-gg k "' S G m i B ) 


d 


[S( g ii g mk V^gyx mliB ] - 3(V z ix i ')^ T (c\ ;fc ) 


dx 


V—gg 


il 


0 


dx 1 




0 


0.X k 


(V=gG\ B ) 


V 


gg 


il 


0 


1 


5 [6( V^i g km )G m J 1. 


dx 1 { V—g dx k 


mi B 


(39) 
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A 

FIGURE 5.1 

The requirement that SG { , be zero, with respect to 

Sik ~* Sik + when B lies appreciably outside the light 

cone from A, leads to the canonical electromagnetic tensor. 
If scattering backward from points within the light cone 
were permitted, then SGj t would be nonzero, and we 
would obtain the so-called Frenkel tensor. 


The differential operator acting on 8G mi on the lefthand side of (39) is the 
same as that acting on G mi in (36). We can therefore use appropriate so¬ 
lutions of the wave equation (11) to obtain G t { . Some care is needed, how- 

A B 

ever, in handling the retarded and advanced parts of the source term in 
(39). Restoring the subscript X, and writing 8Q\ for the righthand side 
of (39), we have 




i x (ret) 


+ Q 


X . 
l B 


(adv) 



where the retarded and advanced parts in (40) are obtained by putting 


Gi 


l X l B 


- fG • 

2 ll X l B 


(ret) 


+ G< { < adv) l 


X l B 





in Q\ ; G i i (ret) , G i i (adv) , being the retarded and advanced solutions of 

B X B X ' B 

(11). The issue requiring particular care is that the correct propagator for 


Q\ (ret) is G, . 

^ l B l A l X 




i 


B 


A m A M. 
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The reason for this separation ot the retarded and advanced parts of Q\ 
will be understood from Figure 5.1. Since 


G ■ (ret) = 

% A l X 



Gx 


(adv) 


1 a Tv 


— Gx 


(ret) 


l x l A 



we can write, equivalently. 



The two terms in (44) are treated similarly; so it is sufficient to consider 

one of them. As usual, we set the dg ik and their first derivatives at zero 
on the boundary of V. Defining 


C J 


mli 


(ret) 


dGn (ret) 3 G mi (re0 

^ B _ TWlg 


B 


dx 


m 


dx 


i 


oj- (adv) 




mli 


dG u ,adv) a <7 . <adv) 

l V B 1711/ Q 


dx 


m 


dx 



and using the divergence theorem, it is not hard to see that 


2 f Q. (ad y)Qi x (ret) ^4* 

X A B 


1 

2 


f 8(g u g mk V^g)» iU^ M $mli B M d 4 x 



9 


«( g il )G u f M ^7 [G ,<ret)fc is;ft .] d*x 


f «( g il ) [G^\. k ]G u ^ d 4 x 
f S(V^g)G^\ ;k G Ml iB . l d*x. (46) 


The index X has again been suppressed on the righthand side of (46) from 
indices taken at point X. 

To calculate the variation of (29), we simply integrate along the world 
lines of a and 6 , and take the double sum with respect to a , b. If there is 
no net charge creation we get, as explained before, 






Since the advanced and retarded parts of J G. is db 1 * vanish separately, only 
the first term of (46) survives. Moreover, after the sums with respect to a , 
b , this term becomes symmetric in the advanced and retarded fields. The 
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two terms of (44) are therefore seen to give equal contributions. Because 
of the factor 2 on the lefthand side, it follows that (46) gives the full con¬ 
tribution of (44). Using (35) in the form 


pretib) 


ik 


477^ 



(ret) ^x B ^7adv(&) 


ik 


477 6 \ 



(adv) (48) 


we have 

~ T 5 2 E 4,ne a e b I f Gi A i B da ' A db%B 

L a b 

= 2 2 f 8 (S il g mk V^g)F™ m ml F^‘\ k d*x. (49) 

a b V 

Remembering the definition (32) of the energy-momentum tensor, we find 





^ J7adv(a) 




pret(b)l 


m 


L b 


"Sp p&dv(a)il 
a 


pret(b)k^ 



The result (50) is similar to the energy-momentum tensor of the Maxwell 
theory, except that the advanced and retarded fields appear symmetrically 
in (50), whereas only retarded fields occur in the Maxwell theory. However, 
(50) can easily be reduced to the usual form when account is taken of the 
response of the universe. 

Consider the slab of spacetime 0 < t < T. For any point X in the slab 
we have 


zrret(a) — rrret(a) . /7ret(a) 

1 ~~ 1 0 < t < T ' r t 

rad via) Z7adv(a) i i7adv(a) 

r — 1 0 <t<T' r t 


< 0 ’ 


> T 


(51) 


for any particle a. As in Chapter 4, we consider the case F ret t 0 = 0, and 
express the response condition in the form 


pad v(a) 


t > T 


j^7ret(a)^ ^ ^ _ y^adv(a) 


0 < t < T 


]• 


a 


a 



Substituting (52) in (50) leads immediately to 
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a 





which is the same as the usual energy-momentum tensor. 


§5.3. A GENERAL CORRESPONDENCE BETWEEN 

FIELD THEORIES AND DIRECT-PARTICLE THEORIES 

The emergence of an energy tensor in the above calculation and its similarity 
to the energy tensor (33 ) of field theory is surprising at first. But this is no 
accident. Rather, there exists a general correspondence between the field 
concept and the action-at-a-distance concept, of which electromagnetism is 
a special case. We now state a general result obtained by Narlikar (1968) 
that covers a wide range of cases. 

Let <^> be a tensor field of rank N in interaction with particles and given 
by an action of the type 



(54) 


a 


The first term contains a field Lagrangian, the second describes the inter¬ 


action of <(> with particles. L[<^>] and /[<£, a] satisfy the following conditions. 
First, L[4>] is a bilinear invariant composed of c/> and of its first derivatives. 


The coefficients appearing in the invariant are functions of the spacetime 
geometry. Thus for a tensor field L[<f>] would be a combination of the 
form 



where A, B , C, are tensors connected with the spacetime geometry. 
Second, /[</>, a] is an invariant expression of the form 


/[<*>, a] = D<p • i >a) , 


(56) 


where D is a coupling constant and £ <a> is a tensor of rank N depending 
only on the world line of particle a . 
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To a field theory given by (54), there corresponds a direct-particle theory 
of the form 


jZl D2 ff t <a> • G ab • da db, (57) 

a b 

where G AB is the symmetric Green’s function of the field equation satisfied 
by <p. Thus G AB is a tensor of rank N at each end A, B. The direct field 
due to particle b is given by 

<jJ b) (X) — D J G xb • | (w db. (58) 

It can be shown that the variation of geometry expressed by (31) leads to 
similar energy-momentum tensors in the two theories. 

We end this chapter by giving the field theory corresponding to the inertia 
theory described later. The mass field m(X) has the action given by 


m.m 1 - Rm 2 

6 

where m i = dm/dx\ It is easily verified that (59) is conformally invariant. 
A theory of this type has recently been used by particle physicists to intro¬ 
duce scale invariance, the massless particles given by 

□m + — Rm = 0 (60) 

6 




being called “dilatons.” 





INERTIA AND GRAVITATION 

TREATED CLASSICALLY 


§6.1. UNITS IN PHYSICS (LECTURE 14) 

Let us return for a while to the flat space of special relativity. A general 
point X in spacetime is described by four coordinates /, x, with the 3-vector 
x = (x,y, z) carrying the spatial coordinates x,y,z. The origin is the point 
t — 0, x = 0, and the square of the four-dimensional distance from the origin 
to X is given by 


t 2 — x 2 . (1) 

Different observers in uniform relative motion with respect to each other 
can all choose the same spacetime point as origin, but then they find that 
the coordinate values which they attach to the general point X are not usually 
the same. However, (1) calculated for any X by each observer is the same. 

Sometimes the velocity of light c is included in (1), c 2 t 2 — x 2 . This is 
necessary if we elect to use a unit for measuring time that is different from 
the unit used for measuring spatial intervals. Although such a procedure 
may be convenient in some practical applications, there is no more logical 
reason for using a different time unit than there would be for measuring 
x, y, z, in different units. Apart from the sign difference in (1), all four 
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coordinates appear on the same footing, and should be described in terms 
of the same unit. Then c — 1, all particle velocities have magnitudes between 
0 and 1, and these magnitudes are dimensionless. 

In the MKS and SI unit systems, it is recommended that x, y 9 z, be 
measured in meters and t in seconds. With the definitions used for the meter 
and the second, one then has 

c = 299,792,500 meters second -1 . (2) 

This is said to be the “velocity of light,” but the velocity of light is unity; 
so (2) only describes the relation between “meter” and “second,” 

1 second = 299,792,500 meters. (3) 

The number 299,792,500 is manmade. 

In the cgs unit system, the relation between the spatial derivatives of the 
magnetic vector H and the time derivatives of the electric vector E is 

cud // = 4 + (4 ) 

c c ot 

The scale factor c — 3 * 10 10 cm sec -1 (telling us that one second is the same 
as 3 • 10 1() centimeters) belongs with t in the second term on the righthand 
side, dE/d(ct), correctly reminding us that the same unit must be used for 
taking both the time and space derivatives. And the current vector j is 
determined by the motion of charged particles. In the cgs system, the velocity 
vectors of the particles have to be scaled by c _1 . This is why c _1 also appears 
in the first term on the righthand side of (4). 

The cgs system is logically defective in using different units for t and x, 
but at least it always reminds us where scale factor corrections must be 
introduced. This reminding permits calculations to be made without serious 
risk of error, and without distorting the meaning of the physical equations 
themselves. In the MKS and SI systems, on the other hand, the scale factors 
c~ l in (4) are absorbed into j and E. Hence not only are t and x scaled 
differently, but there is also a further difference between E and //, which 
should properly fit together to form the six components of the antisymmetric 
field tensor F ik and therefore should not be differently scaled. These latter 
systems therefore lead to confused thinking. 

Further, there is no advantage to be gained from attempting to suppress 
the 477 in (4). The unit sphere in Euclidean 3-space has area 4 tt. However 
intensely the protagonists of modern unit systems may regret this fact, it 
happens to be true, and it causes 477 to appear in various physical equations. 
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If one suppresses the 477 factor in one place, it appears inevitably somewhere 
else. 

The above discussion contains a still undefined concept, that an observer 
is able to measure coordinate dilierences t 2 — t v x 2 — jc j , between points 
X v X 2 , in terms of some length unit L. We must consider next how this 
is to be done. Spacetime itself provides no answer. Spacetime is not divided 
up like a four-dimensional piece of graph paper into equally spaced intervals 
L that serve as a grid. We must appeal to physics, not to geometry. 

A plane electromagnetic wave of frequency A varies with position X in 
a region free from particles according to the phase factor 

exp[/(A' • x — A/)], k=\k\, (5) 

the 3-vector k giving the direction of propagation of the wave. Such a 
physical disturbance can be used to establish a spacetime grid. For a specific 
spatial coordinate, say, x, choose the wave with k in the direction of the 
x-axis. Then (5) takes the form exp[/A(x — /)]. The grid can be imagined 
to be set up in the following way. On any hyperplane t = constant, the phase 
factor has periodicity 277A -1 with respect to x, and on any plane x = constant 
the phase factor has the same periodicity with respect to the time-coordinate. 
Similar periodicities can also be set up for the y, z, coordinates. A standard 
grid can thus be obtained using the A -1 value appropriate to the radiation 
emitted by an explicit atomic transition. The grid has the desirable property 
that it gives the same unit for the t coordinate as for the spatial coordinates. 
However, this important property is unfortunately obscured in practice 
because different atomic transitions are used for time and for space. For 
the space coordinates, the 2 p lQ to 5 d b transition of 86 Kr is used, whereas 
for the time-coordinate the transition used is that between the two hyperfine 
levels of the ground state of 133 Cs. 

The merit of using an explicit transition to determine a specific A -1 value 
lies in the fact that, if we choose a suitably long-lived excited atomic level, 
the A -1 value can be well-defined. Such a choice means, however, that our 
basic length unit will be determined by a somewhat obscure transition of 
a complicated atom. The A -1 value in question then depends on the electron 
mass, the fine-structure constant, and the nuclear properties of the chosen 
atom. A precise, direct calculation of such a A -1 value from the basic 
principles of atomic structure is beyond present-day methods, and to this 
extent the procedure remains intellectually unsatisfactory. One might hope 
to define a length unit L in a simpler and more elegant way. It may still 
happen, in those macroscopic problems where the greatest accuracy is 
needed, that it is preferable to work empirically in terms of the transitions 
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of complex atoms, 8<i Kr or 133 Cs, for example, but for most purposes, and 
especially for obtaining insight into the nature of physical dimensions and 
units, a simpler determination of L is still needed. 

From here on we write 


x 9 y 9 z 9 t = jc 1 , x 2 , x 3 , x 4 ( 6 ) 

whenever we wish to use four-dimensional notation. Thus 

x t x l - rfik xi x k , i, k— 1,2, 3, 4, (7) 

where r\ ik — diag( —1, —1, —1, +1) is the Minkowski tensor. 

Consider a nonrelativistic classical particle that goes from point X 1 to 
point X 2 . The path by which it does so can be determined from the action 





where £ is the Lagrangian and the integral is taken along a specific path. 
Suppose the path used in (8) to be subject to a small variation. Let S' + 8S 
be the new value of the action. If SS is zero to the first order in small 
quantities for every such variation, then the path from which we started 
happens to be the classical path. To give expression to this procedure, one 
uses the usual methods of the calculus of variations, the resulting differential 
equations being the “equations of motion” of the particle. 

For a nonrelativistic free particle of mass m, the Lagrangian is just the 
kinetic energy \ mx 2 . For a nonrelativistic particle in an electromagnetic field 
with 4-potential A it we have 



/= 1,2, 3, 4, 



where e is the charge of the particle, and dx* is a four-dimensional coordinate 
displacement along the path. 

It is easy to pass to the relativistic case. Inserting (9) in (8), 



and the electromagnetic term is immediately invariant. Let da be an element 
of four-dimensional length along the path, so that 


da 2 = dt 2 — (dx ) 2 , 


da > 0. 


( 11 ) 
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Thus 


da — dt[ 1 — i 2 ] 1/2 — dt 


1 


1 


2 


• o 

jc~ + 


Replacing (10) by 




i* is seen that the term in jc 2 in the expansion of da yields the jc 2 term in 
(10). There is also a term 



but this does not affect the calculation of the path, since it involves only 
m and the time-coordinates of X v X 2 , which quantities do not change when 
the path is varied. The term of order (jc 2 ) 2 in (13) is neglected in the 
nonrelativistic approximation. The required relativistic form of S is given 
by (13). 

When the equations of motion are determined by requiring 8S = 0 with 
respect to small variations of the path, the electromagnetic term in (13) is 
found to have the effect of subjecting the particle to a force, usually referred 
to as the Lorentz force. It is given by 

e f ik -p, i, k — 1,2,3,4, (15) 

da 


where F ik is defined by 



The above formalism can readily be adapted to nonrelativistic quantum 
mechanics. Instead of seeking a unique path from X x to X 2 , consider all 
paths without time-reversals, and evaluate 


2 ex P 

paths 




Provided a suitable measure of paths is used in this summation, the result 
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has the physical interpretation that the square of the modulus of (17) gives 
the probability that the particle, initially at X v goes to X 2 - 

The phase angle S/h in (17) must be dimensionless. Indeed, the constant 
ft has been introduced to ensure that this is so. Here ft — h/lir, where ft 
is the old form of Planck’s constant. Instead of inserting ft explicitly in (17), 
we could, however, have redefined the Lagrangian to be 



Instead of (17) we would then have 


^ exp (iS). (19) 

paths 


That is, the action S given by (8) with £ containing the tr l factor would 
be dimensionless, and the Lagrangian itself would be an inverse length, L~ 1 . 

The dimensionality of the electromagnetic 4-potential A i can readily be 
determined by considering the classical field. Denoting the particle by b , 
the contravariant form A 1 of the 4-potential due to b at a general point X 
is given by 


A\X) = ej S(s 2 xb ) db\ (20) 

where db l is a coordinate displacement along the path of the particle, B 
is a point at db\ s 2 XB is the square of the four-dimensional distance between 
B and X, and 5(^“ Y5 ) is the Dirac function with the property 

oo 

f 5(^ 2 ) ds 2 = 1. (21) 

— 00 

The covariant and contravariant forms of the electromagnetic potential are 
connected by 


A i = V ik A k . (22) 

From (21) it follows that 8{s 2 ) has dimensionality L~ 2 \ so the righthand 
side of (20) has dimensionality eL~ l . Since A x has the same dimensionality 
as A\ it follows that the dimensionality of the electromagnetic term in (18) 
is (e 2 /^)L _1 , whence we see that e 2 /ti must be dimensionless. With e the 
elementary charge, e 2 /h is the fine-structure constant, with the experi- 
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mentally determined value 7.297351 x 10 3 . Since e 2 appears in association 
with ft in the form (18) for £, and since the properties of physical systems 
are determined from £, it follows that the elementary charge will always 
appear in physics in association with ft, as e 2 /ft . Hence there is nothing to 
stop ft being absorbed into e 2 , in which case the elementary charge is 
dimensionless, and satisfies 


c 2 = 7.297351 x 10' 3 . 



The form (18) of the Lagrangian for a nonrelativistic particle is now 



with the understanding that A i is to be determined with the elementary 
charge satisfying (23). Can ft also be absorbed into ml Since m will never 
appear except in association with ft, there is again no reason why we should 
not do so. This is for nonrelativistic quantum mechanics. The same situation 
occurs also in the relativistic theory. Writing y k for the Dirac matrices, 
k — 1,2, 3, 4, the Dirac equation for a free particle of mass m is 



and the mass occurs only in the form m/ft. 

After absorbing ft into m , it is clear that m must have dimensionality L -1 , 
since £ given by (24) itself has dimensionality L~ x . A length unit can 
therefore be defined in terms of the mass of some chosen particle. Choosing 

the electron. 


L — m e 1 = C, (26) 

where C is the Compton wavelength of the electron. This choice of the 
electron as standard is useful in many problems, because e~ 2 C determines 
the order of the particle separation in macroscopic bodies, and (<? 4 /4 t 7)C -1 
is the Rydberg constant. Thus C and C~\ together with appropriate powers 
of e 2 , determine both the characteristic emitted frequencies from atomic 
transitions and the structure of macroscopic bodies. 

Omitting gravitation, which involves a departure from flat space, physics 
is concerned with a set of particle masses and with thiee dimensionless 
coupling constants—in addition to e 2 for the electromagnetic field, there 
are dimensionless constants for the weak and strong interactions. From these 
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ingredients, together with the field propagators [the function in (20) 

was the electromagnetic propagator], all of physics (excepting gravitation) 
must be constructed. The field propagators have dimensionalities [cS(.v 2 Yj s) 
had dimensionality L ~ 2 ], but they do not determine a length scale. The only 
length scale available is that given by the mass of some reference particle, 
the masses of all other particles then being determined by dimensionless 
ratios to the mass of the reference particle. It follows that the procedure 
suggested here, of determining the length scale from the electron mass, is 
not just one of many possible procedures. Except insofar as some other 
particle might be chosen as standard, it is the only possible procedure. 

All physical quantities now have the form 

(dimensionless number) • C n , (27) 

the power n of C determining the dimensionality, and the dimensionless 
number giving the magnitude of the quantity. All physical measurements 
are concerned with determining such magnitudes. No laboratory measure¬ 
ment tells us about C itself. According to many standard texts, 

27 iC — 2.4263096 x 10 -12 meters, (28) 

it being implied that (28) gives information about the electron. It doesn’t! 
What (28) gives information about is the “meter,” namely, 

1 meter = 277/(2 • 4263096 X 10~ 12 ) • C. (29) 

§6.2. CONFORMAL INVARIANCE 

Suppose an observer at point X receives radiation emitted by atoms at point 
X' that possess a characteristic pattern of spectrum lines (e.g., the Balmer 
series of hydrogen). Let A be the measured wavelength of a particular line, 
and let A 0 be the wavelength of the same line emitted by atoms at rest in 

wr 

the observer’s laboratory at X. It is always found that 

1 + z = A ( 30 ) 

is the same for all such lines. How are we to understand the possibility that 
z might not be zero? 

One explanation for z ^ 0 is that the material at X' is in motion with 
respect to the rest frame of the observer at X. We then have the well-known 
Doppler effect determined in the following way. 
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The index k • x — 7] lm k l x m appearing in a plane electromagnetic wave is 
dimensionless and invariant with respect to a Lorentz transformation. Sup¬ 
pose the velocity of X' to be described by the spatial 3-vector (v, 0,0) with 
respect to axes .x, y, z , at X. The Lorentz relation between coordinates (jt, /) 
with respect to the observer at X and coordinates (x\ t') with respect to the 
observer at X\ the two observers being supposed coincident at t = 0, is 


x _ + vt' 

(\- V 2)V2> 

In order that 



V + vx' 

(1 - v 2 ) 1/2 ’ 



k- x — k x x + k y y + k z z - k t t = k' • x\ 

we require the contravariant 4-vectors (k x , k yi k z , k t ), (k x \ k y \ k z \ k t % to be 
related by 



k x ' + vk t ' 

(1 — v 2 ) 1/2 



Writing 




kt + v k x ' 

(1 — u 2 ) 1/2 



cos a = k x /k t , cos a' = k x '/k t \ 



we can express the first and last of the relations (32) in the form 


k 


X 


k t cos a 


k t '(cosa' 4- v) 


(1 - v 2 ) 


2\l/2 


k t — k 


(1 + v cos a ') 


t 


t 


(1 - u 2 ) 


2\l/2 


(34) 


The second of (34) is the usual Doppler relation, and the result 


cos a 


cos a' + v 
1 + v cos a' 



obtained by combining both of equations (34) is the usual aberration 
formula. 

To proceed, suppose that in some way we could be assured that the 
material at X' is not in motion with respect to X. How then should we 
understand z ^ 0? An unconventional possibility would be to take the basic 
length unit C to be different at the two points in question. This would imply 
that the electron mass was a function of position, m e (X). Although this 
possibility may seem strange, local laboratory experiments cannot serve to 
demonstrate its incorrectness, since experiments tell us nothing about the 
local value of C itself. 
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More conventionally, we could argue that the assumption of flat spacetime 
was in error. In place of 

ds 2 = Vik d x% dx k (36) 

for the square of the distance between points x % and x l + dx\ we could 
write the Riemannian form 


ds 2 = g ik dx l dx k , 



where the functions g ik (x), i, k — 1,2, 3, 4, are such that no transformation 
of coordinates will reduce (37) to (36) at all X. A transformation of coordi¬ 
nates can certainly be found to reduce (37) to (36) in the neighborhood of 
a particular X. However, the transformation required to make a similar 
reduction in the neighborhood of some other point, X 0 , say, will in general 
be different. A plane electromagnetic wave with phase factor exp[ — iri lm k l x m ] 
in the locally flat region at X does not reach X 0 with the same phase factor, 
even when we transform to locally flat space at X 0 . The form of the wave 
is modified by the non-Euclidean nature of the geometry in the region 
between X and X 0 . 

Apart from the Doppler shift discussed above, we therefore have two ways 
of understanding how z in (30) might be nonzero. The possibility of C being 
variable appears heterodox, but we shall now investigate a close connnection 
between this apparently curious possibility and the seemingly more orthodox 
appeal to curved spacetime. This connection arises because, as we have 
already seen in Chapter 2, the electromagnetic theory is invariant with 
respect to conformal transformations (not the same as coordinate trans¬ 
formations), i.e., transformations which change the geometry from (37) to 



where 



The meaning of this invariance is that, if F ik and j m satisfy Maxwell’s 
equations in the unstarred geometry, then 

F* k = F ik , J* m = (40) 

satisfy Maxwell’s equations in the starred geometry. The second of equations 
(40) is just that which we expect for charged particles having the same 
coordinate paths in both geometries. Thus, 
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particles in box of 
unit proper 3- 
dimensional volume 
at* 







similar box but 
in starred 
geometry 



Since 



and since unit proper three-dimensional volume at X contains * Q~ 3 times 
as many particles in the starred geometry, (41) leads immediately ioJF^X) = 

a~ 4 JHX). 

In virtue ol F ik * = F ik it follows that, if a given set of coordinate paths 
for the charged particles produces an electromagnetic wave with phase factor 


ex P [-hi m k l x m ] (43) 

at X for the unstarred geometry, then the same set of paths in the starred 
geometry will also produce an electromagnetic wave at X having (43) as 
its phase factor. However, the metric in the vicinity of X is now scaled by 
S2 2 (X); so the proper frequency of the wave is changed by i?( X)~\ This 
is just what we might have anticipated from the dimensionality C -1 of 
“frequency.’' Because of the conformal invariance of Maxwell’s equations, 
proper frequency behaves with respect to conformal transformations like 
an inverse geometric length. 

Suppose now that an observer at a point, say, X 0 , is able to ascribe 
coordinates x l to matter at all points X from which light is received. Can 
the observer hope to determine the geometry of the world, g ik (X), and also 
to discover if the physical length unit C is the same at all places? Suppose 
the observer imagines himself to be successful in this enterprise. Make a 
conformal transformation to g ik *(X) — f2 2 (X)g ik (X), with £2(X) ^ 42(A 0 ) for 
some point X. Then, provided we multiply the physical length unit at X 
by ti(X) and that at X 0 by &(X 0 ), there will be no change in the results 
of local experiments at X 0 or at X. In particular, there will be no change 
in the relation of the frequencies of electromagnetic waves to the physical 
unit. A spectrum line emitted at X by a specific atom keeps the same relation 
to the physical unit at X, and it reaches X 0 with the same relation to the 
physical unit at X 0 . Hence the conformal transformation, taken together with 
a scaling of the physical length unit at each point X by S2(X), preserves 
all the previous relationships. Evidently then, provided we admit the possi¬ 
bility of a physical unit C(X) varying with X, the starred geometry represents 
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the world just as well as the unstarred geometry. The answer to our question 
is, therefore, no. 

The situation would be different if we knew a priori that C must be the 
same at all X. This is the usual assumption of physics and cosmology. But 
we have no such a priori knowledge, nor can we expect that physical 
experiments will ever lead to a theory which would require such a constancy. 
Physical experiments, one would hope, will eventually lead to an under¬ 
standing of the mass ratios of elementary particles, but we cannot expect 
to learn about C merely from the determination of dimensionless quantities, 
which is all that our experiments permit us to do. It seems, then, that it 
is inherently impossible from experiments and observations ever to deter¬ 
mine a unique geometry for the world. 

On the other hand, it may be possible to expose the usual assumptions 
of cosmology to test. According to conventional ideas, the tensor g ik can 
be reduced to the Minkowski form r\ ik over a region of spacetime with a 
scale of the order of the diameter of clusters of galaxies. It is also considered 
that C is the same at all X within such a region. Then the wavelengths of 
spectrum lines emitted by all objects within such a region should be the 
same, except for variations due to Doppler motions of the objects in question. 
This condition is in principle open to test. At present, the astronomical 
evidence is equivocal; the case of the galaxy NGC 7603 and its companion, 
reported by Arp (1971), is of special interest here. 


§6.3. CONFORMAL INVARIANCE CONTINUED 

(LECTURE 15) 

The dilemma we have now reached has occurred before. Newtonian theory 
sought to determine the physical laws with respect to a unique direction 
of the time-axis—it was considered that the laws should be invariant for 
spatial rotations, but not for general coordinate transformations. This enter¬ 
prise proved to be impossible, and the impossibility became translated into 
a virtue, with the eventual understanding that physical laws must be invar¬ 
iant under general coordinate transformations. We can proceed in the same 
way here, by requiring the physical laws to be also invariant under conformal 
transformations ol the geometry. All such geometries then become physically 
equivalent. If for convenience we choose a particular geometry (in order 
to investigate some problem), our choice has no more absolute significance 
than has the choice of a particular coordinate system. 

Since particle dynamics as now formulated, whether classically or in terms 
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of quantum mechanics, is not invariant under conformal transformations, 
our present enterprise must involve a new approach to physics. The lack 
of conformal invariance shows classically in the form (13) for the action 
S. Since da changes to da * = f2 da under a conformal transformation from 
g ik t0 gik* — & 2 gik> ^ is clear that S' is not invariant under such a trans¬ 
formation unless m is changed to m* = £2~ A m. This change contradicts the 
usual idea that particle masses are fixed and inviolable. 

The same situation occurs in quantum mechanics. The Dirac equation 
in curved space is conformally invariant for 

m* = Or'rn, xp* = S2~ 3/ % (44) 

but not for fixed m. It will be noticed that the second relation in (44) requires 
that the probability of finding the particle in question in a three-dimensional 
box of unit proper volume must change by S2~ 3 in going from the unstarred 
to the starred geometry. This is just what we would expect from the changed 
relation between coordinate volume and proper volume. Once again the 
meaning of (44) is that if xp, m , satisfy Dirac’s equation in the unstarred 
geometry, then xp*, m*, satisfy the Dirac equation in the starred geometry. 
This statement is proved in Appendix 4 (p. 254). 


§6.4. INERTIA 

The transformation m* = ti~ l m demands a physical interpretation of mass. 
To obtain such an interpretation, we now take a Machian view, according 
to which the mass of a particle arises from interactions with other particles, 
and possibly also from self-interaction. Although we use the term 
“Machian,” our position now is much stronger than the rather vague philo¬ 
sophical arguments originally given by Mach. 

Let each particle b give rise to what we might term a “mass field.” Denote 
this field at a general point X by m {b) (X). At any point A on the path of 
particle a , we have m (b \A) as the contribution of particle b to the mass of 
particle a at the point A. Summing for all b gives 

m(A) = ^ m (b \A). (45) 

b 

The non-electromagnetic part of the action, S mai , f° r a set of particles a , 
b, ..., can be written in the form 

"HI* = - T 2 2 { »"K>' 

Z a a 


(46) 


6 INERTIA AND GRAVITATION TREATED CLASSICALLY 


1 10 


In order that (46) be symmetric for any particle pair a , b , we must have 
m {h) (A) of the form 



(47) 


where P(A, B) is a biscalar symmetric with respect to A, B. Then we can 
write (46) in the symmetric form 



(48) 


a b 


from which it is clear that S' at is conformally invariant provided 


(49) 


P*(A, B) = ti{A)- l tt{B)~ l P(A, B). 


The factor \ has been introduced into (46) in analogy to (29) of Chapter 
5, because the pair a , 6, appears twice in the double sum. 

Also, in analogy to the electromagnetic case, we permit a — b in (46). 
It was found necessary in Chapter 4 to prevent A = B in such a case, in 


order to avoid infinities. We could adopt the same point of view here, but 


doing so may not be necessary. The theory here will include gravitation, 
which was omitted in Chapter 4, except that a cutoff, defined gravitationally, 
was used there in order to limit the approach of B to A. However, the theory 
here will not be fully quantum; so this issue remains uncertain. 

The function P(X, B ) propagates the mass field of particle 6, and we expect 
it to satisfy a wave equation of the usual kind. It turns out that for (49) 
to hold, P(X, B) must be of the form (constant)* G(X, B), G(X, B) being 
a function with the properties 


n x G{X, B) + J R(X)G(X , B) = [-g(B )}~ 1/2 B\ (50) 


G(X,B) = G(B,X\ 


(51) 


where <5 4 (A, B) is the four-dimensional delta function representing a source 
of unit strength at B , and /?(2f) is the Riemannian scalar curvature at X. 
In order that masses shall satisfy the usual convention of being positive, 
we take the constant of proportionality between P{X, B) and G(X< B) to 
be positive, 


P(X, B) = X 2 G(X y B), 


X real. 


(52) 
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In flat space R(X ) = 0, and the solution for G(X , B) is just 

G(X, B) = S(s 2 xb ). 



We now have 



From (49), (51), (52), we require 



0 *(X, B) = G*(B , X) = B). (55) 

The meaning of (55) is that if G(X, B) = G(B, X) satisfies (50), then G*(X, B) 
given by (55) satisfies the corresponding wave equation in the starred 
geometry; we end this section by proving this result. 

First, we recall (77) of Chapter 2, 


R* = tt~ 2 [R + 6£" 1 Qf2], 



so that 


2 

6 


R*(X)G*(X,B) 


l 

6 


S 2 -\B)Q-%X)G(X, B)[R(X) + 6 ( 2 ~\X)n x ( 2 (X)l (57) 


Using 


we have 


g 


*ik 


a~ 2 g ik , 


v 


£ 


J2 4 \/ 


g 


(58) 


5) 


1 


a 


V-S*(20 0JC* L 

tt-\B)tt~\x) a 


V-g*(X) g* ik (X ) 


aG*(W 5) 


a.x 


& 


V-gW 9*’ 




3 (G(X,B) 


dx 


k 


V 





f?- 1 (5)^" 4 (A')[J2(A r )D v C(A', £) - <7(A, B)D A f?(A')]. 


(59) 


Adding (57) and (59), the second terms on the righthand sides cancel, giving 
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U’G*(X, B) + f R*(X)G*(X, B) 

6 


S2- 1 (B)S2~ 3 (X) 


U x G(X, B) + ^ R(X)G(X, B) 


C2- l (B)i2~ 3 (X) 

V^giB) 


SAX , B) 


SAX, B) 

V-g*(B) 


(60) 


This completes the proof. 


§6.5. GRAVITATION (LECTURE 16) 

In the Einstein theory one starts with an action 

S = l^Gf R ^ d4x ~Z 

a 

in which the mass m a of particle a belongs autonomously to the particle, 
and is a fixed quantity. The constant G is also introduced ad hoc, and is 
taken to be positive in order that gravity will turn out to be attractive. A 
variation 


formula 

m a f da (61) 


Sik Sik + $gik (^ 2 ) 

is then considered within a finite 4-volume V, with 8g ik satisfying the usual 
conditions of continuity and differentiability. We have 

5S = iLcl ^ R ^S)d*x - 2 m a sfda 

v a 

= ~^q f S(g ik R ik V^g) d'x - V m a S f da. (63) 

^ a 

As in (32) of Chapter 5, the energy-momentum tensor T (mat) i/c of the particles 
is defined by 

- 2 m a 5 / da = ~jf T (maO ,k Sg ik V^g d*X 

a 

= jf T (matHk Sg ik V^g d'x, 


( 64 ) 
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the last step of (64) following from 8g ik = -g ip g kq Sg™. It can be shown 
without undue difficulty that 


= 2 
a 


■ 1Z 


<5 4 (A r , A) da 1 da k 


i ——--- da 

[~g(A )] 1/2 da da 


(65) 


Noting that 8 V —g 


iga s g ik V^g, we get 


ss ~ i^g .( (• Rik ~ i R)sg ik v^gd 4 x 

+ ~2 J ^(mat>i/c ^ g Xk V — ~g d 4 X -f J g lk SRik V — ~g d 4 X. (66) 

A straightforward calculation then shows that the last term on the righthand 
side of ( 66 ) is zero, so that 

SS = f v ( R ik - j gik R + T mat )»fc) 8g ik \T? d 4 x. (67) 
Setting 


8S = 0 ( 68 ) 

for 8g lk satisfying the conditions of continuity and differentiability but 
otherwise arbitrary, and for V arbitrary, then requires 

^k ~ + 877 G T imSLtHk = 0, (69) 

which are the Einstein equations. 

In the direct-particle theory, in place of (61) we have the simpler expres¬ 
sion 


s = - y 2 / w O) del = ~Y 2 2 // G ( A > B ) da db - ( 70 ) 

a a b 

Subject to ( 68 ), can this be made to lead to the Einstein equations? Since 
there is no ad hoc constant G in (70), there cannot be a precise equivalence 
to the Einstein theory. The coupling constant A 2 —which is so far arbitrary— 
does not take the place of G, since A 2 will simply cancel from ( 68 ). It follows 
that, if we can obtain equations of the form of (69) from (70), the quantity 
G must turn out to be calculable. It can be seen from (61) that G must have 
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dimensionality L 2 . It is therefore not on the same footing as the dimen¬ 
sionless fine-structure constant e 2 , and must change as i? 2 in a conformal 
transformation—if indeed our physics is to be conformally invariant. 

Although (70) is simpler in its statement than (61), the calculation of 8S 
for (70) is more complicated. The equation satisfied by G{X , B) is 


0 


dx 1 * 


V-g(X)g i * k x 


dG(X, B) 

dx k * 


1 


+ -R(X)V^g(X)G(X,B) 

o 


= 8 4 (X,B). (71) 


The variation of (71) can be written 


0 


dx 


V-ZX ik -^ir8G 

55 dx k 


1 


+ -rRV^gSG 


d 


dx' 1 


S (v-gg ik ) 


dG 

dx k 


]_ 

6 


S(RV^g)G. (72) 


Just as in §5.2, we deal with the source term on the righthand side of (72) 
by splitting G into retarded and advanced parts, 


G(X , B) = j [G ,ret, (A\ B) + G (adv) (.Y, £)]. 


(73) 


The retarded part gives the following contribution to 8G(A, B): 


1 

2 


f G (ret> (A, X) -X «(VZig“) 

•^T 7 OX 


dG ire0 (X, B) 


dx 


d 4 x 


1 


12 


x 


From the divergence theorem, and noting that 

G (ret) (A, X) = G iady) (X, A\ 
(74) can be written more symmetrically as 


(75) 


1 

2 


0 

f S( g ik ) — [G <adv) (A r , .4)] ~ [G <ret) (A r , B )] d*x 

**v OX ox 


0 


1 


-jjf S(R ^-g> GiaM (X, A)G (ret >(X, B) d 4 x. 


(76) 
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The advanced part of (73) gives a similar contribution to 8G(A, B), but with 
the advanced and retarded forms of G in (76) inverted: 


j (3 '?) 

y f 5 ( V^g [G <rM (X, A)] —- [G (adv) (T, B)\ d*x 

•A OX OX 

8(R V-~g)G (ret \X, A)G (s,M (X, B) d\x. (77) 

Then 8G(A , B) is the sum of (76) and (77). 

The next step is to obtain 



*'2 2 // 8G(A,B)dadb. (78) 

a b 

Defining a “mass field” 

M(X) = ~ [M {Tel \X) + A/ (adv) (7)], (79) 

where 



M <re0 (X) = A X / c<ret, (A ', A ) da > 

a 

(80) 


M (adv> (7) = A Yj f G ,adv> (X, A) da, 

a 

(81) 

(78) simplifies to 


/J 

8( \G—g g ik )MX ei) M k {a - Av) - 7- 8(R ^/^T)A7 <^et, ^/ (adv, 

6 

(82) 

where 

». ret . _ 3M (re0 w (adv» _ 3AT adv) 

1 dx l ’ * ax' 1 ' 

(83) 

Remembering the factor A 2 in (52), the mass m(A) of particle a at point 


A is \M(A ), and 
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8S 


1 



2 


1 


2 


1 


2 

1 

r 

“ 2 

Jy 


1 


2 


1 


6 


8 


2 / m(A) da 


a 


2/ m(A ) 8 ( da ) 


a 


1 

2 


^ f Sm(/1) 




a 



T (mJ k S gik V 


gd*x f SG(A, B) da db 


a b 




g d 4 x 



v L 


5( V^g g i ' £ )A/ i <^et) A4 <adv, 


- S(R V^g) M <ret) M (adv) 




(84) 


The energy momentum tensor 




2 f m(A ) 


8 AX, A) da 1 da k 

-ttt, —;-— da 

[ — g(A)] lu da da 



is obtained in the same way as in the Einstein theory. The mass m(A) can 
depend now on point A, however, and must therefore be placed within the 
line integral. 

At this stage we might pause to note that we are here considering all 
particles to have the same "coupling constant'’ A to the mass field. It is easy 
to modify the above discussion to deal with a case in which particle a has 
coupling constant A a , particle b has coupling constant A 6 , and so on. We 
change the definitions (80), (81), to 


M (rel \X) = ^ \ a f G (re0 (X, A) da, (86) 

a 

M illM (X) = X A « / <j <adv) (A r , A) da, (87) 


and define the mass of particle a at point A by 
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( 88 ) 


Then 




2 / m aO> da 



continues to be given by (84), provided m(A) in (85) is replaced by m a (A). 

To obtain the gravitational equations, we have to express the second term 
on the righthand side of (84) in the form 


Qtk 8 g ik V-g d * x - ( 9 °) 

The gravitational equations, given again by putting 8S = 0 for arbitrary 
8g ik , are simply 



Qik + 


(mat)i/c 


0. 



The term involving is easily seen to give the contribution 


1 

2 


[M i {ret) M k 


(adv) 


+ A/ i (adv) M t . (ret) 




Writing 


S(RV^g) = S(R ik g ik ^g) 


= (*« - jgikR)sg ik V^g + g ik 8R ik V^g, (93) 

we have a further contribution 

£ ~ j gik *) A/ (ret) M (adv) , (94) 

the remaining term of (84) being 


— f $R ik g ik M iTet) M ia - dv) \/—g d 4 x. (95) 

12 Jy 

If we express (95) in the form 

j f &i k s g ik vT g d * x > 


(96) 
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the gravitational equations become 


TmatHfc + <*i k +U Ri «- Y ^ ^ M<ret,M<adV) 


- [M i (ret) M fc (adv) + M /t <ret, M i (adv) - g ik g PQ M q iret) M p {!liv) ] = 0. (97) 


2 


As in the Einstein theory, we begin the reduction of (95) with 


g i/c 8R ik 


1 


0 


dx 


[ V—g w,i ]> 


(98) 


where 


w 


i xri _ Q il zrk 

01 ik 6 01 ik- 


g 


(99) 


Using the divergence theorem we can therefore write (95) as 


1 



w 


i 


0 


12 Jy dX l 


j^(ret)^(adv)j JTt. 


( 100 ) 


And using the identities 


r k * 


i 


0 


ik 


V^g dx 




gi*r 


l 


1 


0 


ik 


V^g dx 


(V^gg lk ), ( 101 ) 


we see that 


w 


i 


$g 


ik 


W l ik + y SikS plrq pg ~ y gikg P9jr 


pq 


1 


0 


V-g dx 


T l S (g il V~g)] 


1 


V 


g 


g 


il 9 


0.X 


TiSy^g). ( 102 ) 


Substituting (102) in (100), and making straightforward reductions in which 
the divergence theorem is again used, we obtain 


1 


12 


f {g ik □[M <ret) M (adv) ] - [M (ret, M (adv) ]. u } Sg ik V^gd 4 x. (103) 


The gravitational equations therefore take the form 
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H R ik - \g ik R){M^M^\ + T (mmk 

— — [M i (ret) M fc (adv) + A/ fc (ret) A// adv) — g ik g pq M {ret) M (adv) ] 

~\{gik □[A/ (ret) A/ <adv) ] - [M (ret) M <adv) ] ;ifc } = 0. (104) 


It is ot interest to consider the contracted form of (104), 


\ 

6 


Pv[M (ret) M< adv) ] + 7J mat) + g pq M p ^M q ( adv) 


Jl 

2 


□[M (ret) M (adv) ] = 0. (105) 


Writing 


m(A) = \M(A) = y A[A/ ,ret, M) + M <adv> (^)] 

iw 


(106) 


in (85), 


T 


(mat) 


y A ^ / [A/ (ret> 0) + A/ ,adv) 64)] 

a 


[-sO )] 1/2 


da. 


(107) 


Furthermore 


□[M (ret) M (adv) ] 

= Af (ret) DM (adv) + Af (adv) DM (ret) + 2g pq M p iret) M q Udy) = 0, (108) 

so that the term in g pq M p irei) M q (a - dv) cancels from the lefthand side of (105). 
Moreover, the definitions (80), (81), of Af (ret) , M (adv> , taken with the fact 
that (7 (ret) , G (adv) , are the retarded and advanced solutions of (71), lead to 


□M< ret) (JO + j R(X)M {ret) (X) = \ 2 J - 


[ 


□M <adv) (A r ) + -- R(X)M <liM (X) = X 2/ 

a 


i 


« 4 (*U) ^ 

-I/O 9 

-£(^)] 1/2 

(109) 

S 4 (*»v4) ^ 

-7777 da. 

-g(A)] 1/2 

(110) 
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Hence the lefthand side of (105) vanishes identically. What can this result 
mean? 

The aim of the Einstein theory is to determine both the geometry, ex¬ 
pressed by the symmetric metric tensor g ik (x), and the trajectories of the 
particles. At first sight there might seem to be ten independent components 
of g ik at each point X required to determine the geometry. Yet the coordinate 
transformation x H = x /l (x) changes the g ik to g ik , where 



dx l dx m 
dx H dx' k Slm ’ 


( 111 ) 


and the tensor g ik determines the geometry just as well as g ik . Hence not 
all ten components of g ik are needed to determine the geometry—the ten 
components contain a redundancy which expresses our freedom to make 
coordinate transformations. In fact, only six independent functions are 
needed at each point X in order to specify the geometry. This is a question 
that we shall consider further later on. 

Yet the Einstein equations (69) at first sight seem to have ten independent 
components. These do not overconstrain the determination of g ik9 however, 
because of the identities 


ik R ) =0, (112) 

';k 

which require 

Tmat>2* = 0. (113) 

Equation (113) has four components that determine the trajectories of the 
particles. Thus the ten Einstein equations can be thought of as comprising 
six for the geometry and four for the particle trajectories. 

In the theory here our aim is not to determine a unique geometry. Because 
of the conformal invariance of the theory, we can determine the geometry 
only to within a conformal transformation. This reduces the number of 
functions needed at each point X from six to five. The ten components of 
(104) contain four conditions to determine the particle trajectories, five to 
determine the family of conformally equivalent geometries, and one identity. 
The identity appeared when we took the contracted form of (104). Thus 
we have explained the result we reached with (110). 

We come now to the critical question: Can we choose a particular geome¬ 
try belonging to our family of conformally equivalent geometries in which 
the gravitational equations (104) become the same as the Einstein equations? 
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Piovided we introduce a response condition” ot exactly the same form as 
that used in §5.2 for the electromagnetic energy-momentum, the answer 
to this question is yes. Let us follow the procedure of §5.2, simply replacing 

^ * T “a A aKadvl , by A/ <ret) , ,V/ ,advl . Thus consider the slab of spacetime 
Q < t < T. For any point X in the slab, we have 

M { re° = < ( < T + < 0 , 

A/ <adv) = M ,adv) 0 < * < j, + M <adv) ( >T . (114) 

As in §5.2, we suppose for simplicity that the retarded fields “start” at t = 0. 
This is expressed by putting A/ (re,) t < 0 = 0. For A/ <adv) ( > T we use a response 
condition similar to (52), 

>T = A/ <ret, 0 < t < T - A7 <adv) 0 < isr . (115) 

Then 


A/ <adv) = M (ret) = Af (ret) 0 <; < r , (116) 

and we can write (104) in the form 

K ( R ik -\gikP) = -3[TmatH fc + AJ + (&* - A ;tt ), (117) 

where 

A = 1 [A/<ret)]2 (U8) 

* 4 * = - + y g ik g pq M^ Tei) M q (iei \ (119) 

and where T imSLt}ik can be written in the form 


T 


(m at)i k 


M irel \A) 


8AX, A) da { da 


k 


[— g(A)] 1/2 da da 


a 


da , 


( 120 ) 


so that only A/ (ret) appears in (117). Now because the theory is conformally 
invariant, we can choose a particular conformal frame in which M {ret) (X) 
is independent of X, since the mass field M (ret) transforms as i? -1 in a 
conformal transformation. If M (ret) is not constant to begin with, we can 
evidently make M* (ret) constant by a conformal transformation with 
S2 oc M {ret) . Then (117) simplifies to 





122 
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K(R ik -jg ik R) = -^T (nmm , (121) 

which has the form of the Einstein equations. Moreover, the particle masses 
AM ret are now constant, as in Einstein’s theory. Indeed, the only difference 
from Einstein’s theory is that in this theory we have an explicit determination 
of the gravitational constant. Thus defining 

8 77 <7 = 3/K, (122) 

the equations (121) become the same as (69). Since M iret) is in principle 
calculable for any given cosmological model, K, and hence G, is calculable. 
We shall consider such calculations for various models in the next chapter. 
Here we simply notice that the definition (118) requires AT, and hence G, 
to be positive. It follows that we have deduced that gravitation must be 
attractive, whereas in the Einstein theory the assumption G > 0 was intro¬ 
duced ad hoc. 

The appearance of A in (120) does not mean that the value of this coupling 
constant affects a purely gravitational theory. Since M irei) is also proportional 
to A, every term in (117) has a A 2 dependence, so that A 2 could be canceled 
throughout (117): If, however, we were also to include an electromagnetic 
field, with (117) changing to 

— ~2 — ~3[7(matK/c + T {em)ik + @ ik ] 

+ (g ik HK - K. ik \ (123) 

we would have A 2 in all terms except T (em)ik . The value given to A 2 then 
affects the relative importance of T {em)ik to the gravitational field: the smaller 
A 2 is, the more important the electromagnetic field becomes. Also in a re¬ 
duction to the Einstein form 

^ (^i/c — ygi/c^ — — ^[^(mat )ik + ^(em )ik ]. (124) 

with K now constant, we have 

1W + w* = °> (125) 

and the value attributed to A 2 affects the orbits of charged particles. 

The discussion here has shown that the concept of a “response of the 
universe” has application outside electrodynamics. Later we shall find a 
further example in which a response condition plays a crucial role. All these 
response conditions are structurally similar to each other, just as (115) is 
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similar to (52) of Chapter 5. Both conditions are classical, however, and just 
as the quantum-mechanical response of the universe, given by (124) of 
Chapter 4, contains subtleties not present in (52) of Chapter 5, so we should 
expect a quantum form of (115) to open out a range of phenomena analo¬ 
gous in richness to quantum electrodynamics. In this book we shall not 
attempt anything as elaborate as a full quantum treatment of inertia and 
gravitation, although we shall take up some questions preliminary to such 
a treatment in Chapter 8. Before coming to these questions, we proceed 
in Chapter 7 to apply the classical development we have reached to well- 
known cosmological models. 


Equations of Motion 


The usual procedure for obtaining the equations of motion of the particles 
is to go back to the action (70) and to vary the particle paths individually, 
requiring 8S = 0 for such variations. However, this procedure obscures the 
fact that the equations of motion are contained within the gravitational 
equations. Here we shall give a direct derivation from the gravitational 
equations, using first the conformal frame in which the particle masses m a , 
ra 6 ,..., are constant. In this frame, and in a purely gravitational field, 


T ik 

1 (mat);/c 



(126) 


is the required equation. Here 


T {m J\X) = 2 nr a 

a 


S 4 (X, A) da' da k 
V — g(A) da da 



(127) 


so that 


dT ik 

dx k 




1 da { da k 


da 


da da 


2 


m 



a 


a 


da 


[ W 


1 


da 1 da k 


V— g(A ) da da 


da 


d 


2 m a J — IV*- A)] 


1 


da 


a 


da 


V-g(A) da 


da 


2 f b= 


da 


a 


da ' V— g(A) da 


da , 


(128) 

























6 INERTIA AND GRAVITATION TREATED CLASSICALLY 


1 24 


since the particle paths are not considered to start or to end at the point 


X. Thus 


T ik . 


d T ik 


;k 


dx 


+ r 


k ' ‘ kl 


T lk + r k kl T il 


= 2 


m 



a 


S 4 (X, A ) 

V-g(A) 


f -— d / 1 


da 


da ' V —g(A) da 


i da k da ri k da da 
kl da da kl da da _ 


da , (129) 


and (126) requires 


V 


g 


d 


/ 1 da \ p i da k da ^ ^ p k 

da ' V —g da * 


da 1 da 


i 


da da 


Ik 


da da 


0 . (130) 


Using 


r 


k 


1 


0 


kl 


V—g dx 


l 


V 


g > 


(131) 


(130) simplifies to the usual geodesic equations 


d 2 a i pi da k da 1 _ q 


da 


kl 


da da 


(132) 


To obtain the equations of motion in a general conformal frame, consider 


the effect on (132) of the transformation 


gik 


® 2 gik 


Since 


(133) 


da 


£2 da , 


(134) 


and 


r 


kl 


n . / *■)« * 0 In t (2 

r ki + U in g M - 


dx n 


*i 0 In $2 

g k 


dx 


i 


*i 0 In J 2 

g V 


0.X 


k 


), (135) 


we have 


1 d (£2 da 


L2 da 


da 


q. r*' 


da k dcd 
kl ~da* ~da* 


+ \g* in g k i * 


0 In Q 

0JC n 


*7 0 In i $2 

g V 


dx 


i 


g i 


0 In £2 \ 

i da k da 1 

dx k ) 

* da* da* 


0 (136) 
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for the equations of motion in the starred geometry. The transformation 
function $2 can be replaced by m a /m a *, with m a constant, so that 




da k da 1 

~ckf~d^ 





g* in gki* 



da k da 1 
da* da* 



(137) 


This simplifies to 



+ m 


a 



da k da 1 
da* da* 



(138) 


Thus in a general conformal frame the equations of motion are not geodesic. 

It is possible to obtain (138) directly by dividing (117) through by K , and 
then taking the divergence. The reduction is somewhat awkward, however, 
it being necessary to use the wave equation for the masses, the relation (41) 
of Chapter 2, and also (117) itself. 


















COSMOLOGY 



THE EINSTEIN-DE SITTER MODEL 
(LECTURE 17) 


The Friedmann models are usually discussed with respect to the Robertson- 
Walker line element 




+ r 2 (d0 2 + sin 2 # d<t> 2 ) 



The spaces k = 0, ±1, are each conformal to Minkowski space. Because 
our gravitational theory is conformally invariant, we can evidently consider 
the Friedmann cosmologies in Minkowski space, provided we take care to 
transform the masses of particles according tom* = Q~ x m. The masses are 
taken as constants with respect to the line element (1), m — m 0 , say, so that 
(1) gives the conformal frame in which the gravitational equations reduce 
to the Einstein form. In this chapter we shall consider the three spaces, k = 0, 
±1, in turn. 

For k — 0, we have the Einstein-de Sitter model, with 


ds 2 = dt 2 — Q~(t)[dr 2 4- r 2 (d0 2 + sin 2 0 d<j> 2 )]. 


(2) 
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The line element (2) is easily seen to be conformal to Minkowski space by 
defining a new time-coordinate 


giving 



ds 2 = Q\t)[dr 2 - dr 2 - r 2 (dd 2 + sin 2 6 dcj> 2 )]. 
Choose the conformal function to be 




a = 0 _1 . (5) 

The corresponding starred geometry is then 

ds* 2 = dr 2 — dr 2 — r 2 (d6 2 + sin 2 # d<j> 2 ), ( 6 ) 

and the particle masses are 

m * = Q m o- (7) 

Write n for the particle density in the original Robertson-Walker frame. The 
corresponding density n* in the Minkowski frame is 

n* — f2~ 3 n — Q 3 n — constant, ( 8 ) 

the product Q 3 n being constant in all Friedmann cosmologies. 

The particles have spatial coordinates r, #, <£, independent of t in the 
Robertson-Walker frame, and hence independent of t in the Minkowski 
frame. Writing L~ 3 for the constant density in the Minkowski frame, we 
have 



The Einstein-de Sitter cosmology expressed in Minkowski space is simply 
a uniform static distribution of particles. 

Next we determine Q in terms of r, so that (7) can be used to give the 
time-dependence in Minkowski space of the particle masses. The gravita¬ 
tional equations in Robertson-Walker space (i.e., the Einstein equations) give 



87 tG 

~T~ 


<7, 


3 


( 10 ) 
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where Q = dQ/dt and o is the mass density. Writing o = nm 0 and remem¬ 
bering the definition 

8-77(7 = 6A 2 /m^, (11) 


(10) can be written in the form 


2 A 2 nQ 3 _ /2A 2 L" 3 \ 
m 0 Q \ m 0 ) Q' 



the first factor on the righthand side of (12) being a constant. Integrating 
(12) subject to Q — 0 at t = 0 gives 



2A 2 Zr 3 





Substituting (13) in (3) then leads to the following relation between r and t; 



Replacing t by t in (13), we have 



1 A 2 L~ 3 

2 m 0 




whence 


m*(r) = yA 2 L 3 r 2 . (16) 

The masses in the Minkowski frame are quadratically dependent on the 
time. 

Since we are working with a conformally invariant theory, all relations 
between theory and observation in the Minkowski frame must be the same 
as in the Robertson-Walker frame. Consider the Hubble redshift-magnitude 
relation. 

By a standard galaxy we mean one with a specified number of stars, each 
with the same number of particles of the same kind. Light from a galaxy 
with coordinate r must start its journey at time r — r in order to reach an 
observer at the spacetime point r — 0 at time t. Write L*(r — r) for the 
intrinsic luminosity of the galaxy at that time. The observed flux is just the 
Euclidean value 
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(17) 


Now the intrinsic luminosities of standard galaxies are usually taken to have 


the same value, say, L, in the Robertson-Walker frame, where the particle 
masses are also the same. In the Minkowski frame, on the other hand, we 
must have 


L \r - r) = [S2(t - r)]~ 2 L = Q \r - r)L, 


(18) 


because “luminosity” being “energy per unit time” behaves in the same way 


as (mass) 2 under a conformal transformation. From (15) we therefore see 
that the observed flux from a galaxy at distance r depends on r according to 



(19) 


Since the redshift 1 + z is given by 

m\ t) 



( 20 ) 


m*(r - r) 


it follows, by eliminating r between (19) and (20), that the observed flux 
depends on z according to 


( 21 ) 


1 + z (Vl + z - l) 2 


which is the usual Hubble relation for the Einstein-de Sitter model. 

The dependence (16) of mass on time was obtained above by starting 
with m = iyIq — constant in the Robertson-Walker frame. Can we also obtain 
(16) by direct calculation? In the Minkowski frame, the time-symmetric mass 
propagator takes the simple form 


G(X, B) 



( 22 ) 


while the fully retarded mass propagator is 


G Tet (X, B) = ^ S(t x -t B - \x x - -v s |). 


(23) 


Considering the retarded mass field generated by (23), and accepting that 
the universe came into being at t — 0 in the Robertson-Walker frame, and 
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hence at r = 0 in the Minkowski frame, only particles at distances 
contribute to m*^ Tei \X). The contribution of such a particle at distance r 

is simply 


A 2 

4-777* ’ 



and the total contribution for a uniform distribution of particles of density 

Lr 3 is 


A 2 L 


-3 


2nr 


477 


f sin 0 dO f d<f> f r dr 

*7) *0 *7) 


1 

2 


A 3 L~ 3 t 2 


(25) 


which is the same as (16). Hence (16) can be obtained by direct calculation. 


The Space for k — + 1 


Defining sin R — r, we have 


ds 2 = dt 2 


QHt) [ T 


dr 2 


+ r 2 (d6 2 + sin 2 6> d<t> 2 ) 1 


dt 


Q 2 (t)[dR 2 -f sin 2 R(d0 2 + sin 2 ^ d<$> 2 )]- 


(26) 


The coordinate r is to be regarded as dimensionless, whereas Q has dimen¬ 
sionality C (see §5.1). From the Einstein equation we get 


Q 2 1 
- + 


Q 


Q 


877G 

T" 


a 


877G 

“T” 


nm 0 , 


(27) 


where n(t) is the particle density. Noting that the dimensionless quantity 
nQ 3 is independent of /, we can write 


Q 


a 


Q 


Q 


(28) 


where the constant a with dimensionality C is given by 


a 


877(7 2A 2 

—— m 0 • nQ 6 — - nQ 6 . 

3 m 


0 


(29) 


Equation (28) can be solved in terms of a dimensionless quantity T, defined 
by 
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to give 



Q = j( 1 - cos T), 
t = j(T-sin T). 


In terms of T.\ (26) becomes 

ds 2 = Q 2 [dT 2 - dR 2 - sin 2 R(d0 2 + sin 2 0 d<$> 2 )). 
Next it can be verified that the coordinate transformations 


£ = y (T+ R), 


X = tan i, 2t = X 4- Y, 


t, =1(7- R), 


Y = tan tj, 2 p = X - Y, 



(31) 

(32) 




map the interior of the triangle formed in the R , T, plane by the points 
(0, 0), (77, 0), (0, 77 ), into the quadrant 0 < p < oo, 0 < r < oo, in the p, r 
plane. By carrying through these transformations, it can also be shown that 



4 Q 2 

(1 + X 2 )(\ + Y 2 ) [d 



- p 2 (d$ 2 + sin 2 # d4> 2 )\. 



Therefore the conformal transformation 

i? = C( 1 + X 2 ) 1/2 (\ + Y 2 ) 1/2 /2Q (36) 

maps the original spherical space into half of the Minkowski space 

ds* 2 = C 2 [dr 2 - dp 2 - p 2 (d0 2 + sin 2 ^ J<f> 2 )]; 0 < r < oo. (37) 


To keep track of dimensionahties, the basic unit C has been introduced into 
(36), so that £2 is dimensionless. Numerically, C is unity. The factor C 2 
appears on the righthand side of (37) because r, p, are dimensionless. 
Since the total mass m — m 0 was constant in the Robertson-Walker frame, 
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the mass m 



* is variable in the Minkowski frame and is given by 

2 Qm C -1 

(1 + X 2 ) 1/2 (\ + Y 2 ) 1/2 

[1 + (t + p) 2 ] 1/2 [ 1 + (r - p) 2 ] 1/2 + T 2 - p 2 - 1 ^ £- 

[1 + (r + p) 2 ][l + (t - p) 2 ] 



The particle trajectories are R = constant in the Robertson-Walker frame 
and are given by 


--^^ = constant (A \ say), (39) 

1 + t 2 - p 2 

in the Minkowski frame. In the Robertson-Walker frame, the particle density 
is independent of R. In the Minkowski frame, on the other hand, the density 
depends on both p and t. At r = 0, it has the form 

(1 + p 2 ) -3 . (40) 

Thereafter it changes according to the trajectories (39). In terms of the 
constant A , the radial coordinate along a trajectory is 

p = (A 2 + t 2 -f 1) 1/2 — A, (41) 

from which we see that dp /dr = 0 at t = 0 , and that p —^ r as r — >- oo. 

How shall we interpret these results? At r = 0 we have a spherically 
symmetric cloud with the radial density distribution (40). Initially, at t = 0, 
the cloud is at rest, but as r increases, the cloud moves radially outwards 
and ultimately dissipates itself. It does so, not because of gravitational forces, 
which are zero in this frame, but because dm* /3r, dm*/dp, are nonzero, 
and these mass-gradient components appear in the equations of motion of 
the particles, i.e., in equations (138) of Chapter 6. 

It would be possible to derive (38) by direct calculation. Likewise it would 
be possible to derive the usual Hubble redshift-magnitude relation for the 
case k = + 1, just as we did for the Einstein-de Sitter model. 

It follows from (38) that m* is zero at r = 0, increases to a maximum 
as r increases, and then again tends to zero as r oo. This result means 
that m* is initially zero because mass interactions are restricted to particles 
at distances r < r, which gives m* — 0 at t = 0, and that m* 0 as 
t oo because the finite cloud dissipates itself and the mass interactions 
tend to zero. 
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It comes as no little surprise to find the cloud dispersing to infinity. In 
the Robertson-Walker frame, we think of the universe in the k = +1 case 
as expanding from a singularity to a maximum of the function Q{t) and 
then collapsing back into a singularity. We think of the collapse as being 
a reverse of the expansion, expansion and contraction being determined by 
the two roots of (27). How does this reverse express itself in Minkowski 
space? 

From (31) we see that Q attains its maximum value for T — tt. Yet the 
dispersal of the cloud to infinity in the Minkowski frame corresponds to 
the behavior of the system in the triangle (0,0), ( 77 ,0 ), (0, 77 ), in the T\ R , 
plane; so the dispersal corresponds only to the expansion phase in the 
Robertson-Walker frame. To obtain the contraction phase in which the 
dispersed cloud at infinity falls back to its original state at r = 0, one requires 
the range 77 to 2tt for the T coordinate. The triangles in the T, R , plane 
with this range, and which map into the upper half of the Minkowski plane, 
are 


( 77,77 + 2/777), (277 ,77 + 2wr), ( 77 , 2<tt + 2/777), (42) 

with n any integer. The periodicity of R with respect to 77 is not unexpected, 
but the addition of 77 to R, compared to the expansion phase, in which the 

possible triangles in the T, R, plane with 0 < T < 77 are 

(0,2/777), (77, 2/777), (0, 77 + 2/777), (43) 

requires a change of sign in the relation between R and the original r 

coordinate, to r = —sin R. The metric (26) is clearly unchanged when 77 

is added to R. 

The corresponding triangles going into the lower half of the Minkowski 
plane are 

(0, 77 + 2/777), (77, 77 + 2/777), (77, 277 + 2/777), (44) 

for the range 0 to 77 for T, and 

( 77 , 2 / 777 ), (277, 2/777), (277, 77 + 2/777), (45) 

for 77 < T < 277. The situation for the triangles (42), (43), (44), (45), is shown 

in Figure 7.1. In terms of the original /-coordinate, the time required for 
the dispersal of the cloud—i.e., the expansion phase in the Robertson-Walker 
frame—is given by putting T — 77 in (32), and is ^rra. 
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FIGURE 7.1 

The triangles marked U are mapped into the upper half of the Minkowski plane, 
those marked L into the lower half. The blank triangles have no mapping into the 
Minkowski plane, since they correspond to p < 0. The diagram repeats itself with 
periodicity Iv with respect to both T and R. 


The Space for k — — 1 

Define sinh R — r, so that the metric (1) for this space can be written in 
the form 


ds 2 = dt 2 - Q\t)[dR 2 + sinh 2 R(d0 2 + sin 2 0 d<j> 2 )]. (46) 

From the gravitational equations we have 



with a defined as in (29). Once again (47) can be solved in terms of a 
dimensionless T defined by (30), to give 


0 = y(cOShr - 1), 
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t = y (sinh T - T). (49) 

The appropriate coordinate transformations differ from (34) in the defini¬ 
tions o! X , Y , 


X — tanh £, Y = tanh 17, (50) 

but are otherwise the same. The coordinate transformations now map the 
quadrant 0 < /? < 00, 0 < T< 00, in the R , T\ plane into the interior of 
the triangle (0, 0), (1,0), (0, 1), in the p, r, plane. Corresponding to (35) we 
have 


40 2 

ds 2 = - "_ Y-) ^ _ rfp2 _ p2( ~ de2 + sin2d d<t>2 ^' 

The function required for conformal transformation to Minkowski space is 
therefore 


42 - [C(l - X 2 ) 1/2 (\ - Y 2 ) 1/2 ]/2Q, 


(52) 


and the resulting starred geometry is 


ds 


*2 


C 2 [d 


dp 


p 2 (d6 2 + sin 2 # d<$> 2 )\. 


(53) 


Corresponding to (38), (39), we have 


m 


2 QC 


-1 


m 


(1 - X 2 ) 1/2 (\ - Y 2 ) 1/2 


1 + t 2 — p 2 — [1 — (t -f- p) 2 ] 1/2 [l — (r — p) 2 ] 1/2 

[1 - (T + P) 2 ][l - (T - P) 2 ] 


(54) 


aC 1 m, (55) 


2p 


1 + P 


2 _ r 2 


constant 


(A \ say), 


(56) 


and corresponding to (40), (41), we have 


(1 - P 2 )- 3 , 


(57) 


p = A — (A 2 4 - t 


1) 


1/2 


(58) 


We again have dp/dr = 0 at r = 0, but the radial coordinate p now tends 
to zero as t — *- 1. 
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The k — — 1 Friedmann model is therefore to be interpreted as an initially 
stationary cloud confined to p < 1. It collapses to zero radius at r = 1, and 
it does so because of an initial singularity in the density of matter (57) as 
the radial coordinate approaches 1. Once again the collapse occurs because 
of mass-gradient components in the equations of motion (138) of Chap¬ 
ter 6. 

§7.2 THE ORIGIN OF THE UNIVERSE 

Throughout the preceding discussion, we took as our starting point the fact 
that the various Friedmann models are known to satisfy the gravitational 
equations. The latter are usually considered in terms of a smooth-fluid 
approximation in which a density function p(X) is defined by 

p(X) = f V A )- m(A) da, (59) 

unit [3] a V“ g(A) 
proper volume 

it being assumed that the length unit is large enough for many terms in 
2 a to contribute to p. In a general conformal frame, the gravitational 
equations are given by (117) of Chapter 6. For the line element (1), and 
for the Einstein-de Sitter model, these equations lead to two independent 
conditions 



Q 2 

2 P 

m 2 

2 Q 

m 

(60) 


Q 2 

m 2 

o 

' Q 

m ’ 

• • 

2 Q 

Q 2 


4 m Q 

2 m 

' 9 

(61) 

Q 

4 - 

Q 2 

— 

m Q 

m 

' + 2 ’ 


where the dots denote differentiation with respect to time, and where for 
convenience we are putting the coupling constant A = 1. Because of the 
spatial homogeneity of the model, only the time derivatives of the mass 
function m appear in the smooth-fluid approximation. 

In the conformal frame in which m is constant, (61) leads to 0(t ) oc / 2/3 , 
whereas in the conformal frame in which Q — 1 at all times, the same 
equation gives m oc T L> . We have seen above how this latter proportionality 
can be obtained in a smooth-fluid calculation. We end the present chapter 
by considering m oc r 2 in relation to the discrete-particle picture in which 


m(X) =2/ G (ret) (A r , A) da. 


(62) 
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The particles are static in the Minkowski conformal frame and 


G (ret) (A, A) 






Thus we can write 


m(X) 







and 



1 


2 


1 


477 ‘—' \X Y — X A 
a 1 * A 



d 


dr 




The particle paths are considered to go from 


1 


m 




1 


477 "—' \X Y — X A 

a 1 a A 


S(Ty ~ 



r = 0 to t = oo, so that 
\ x x - x a\X ( 66 ) 


the delta function being zero for r A —oo. If the paths were considered 
to go from —oo to oo, the delta functions would be zero at both “ends,” 
and m would be zero. Although (61) would still be satisfied, (60) would then 
lead to p = 0. Hence we see that it is the broken ends of the paths at r = 0 
which permit a nontrivial model to exist. Writing r — \x x — x A \, dropping 
the subscript X , and evaluating (66) in integral form, we have 

oo 

m( t) = n I S(r — r)r dr = m , (67) 

J o 

where n is the proper density of the particles. 1 This is the same result for 
m as before. 

The fact that broken ends thus play a critical role in the Einstein-de Sitter 
model, and similarly for the models with k = ±1, means that the usual 
forms of the Friedmann cosmologies are not in principle different from the 


1 Owing to the sense of the discussion here, we have dropped the starred notation for the 
Minkowski frame. The n used here was written earlier in the chapter as /?* and m was written 
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steady-state model in which broken paths represent particle creation. It is 
true that broken ends appear only at r = 0 for the Friedmann models, 
whereas broken ends appear at all times in the steady-state model, but this 
difference is technical rather than one of principle. 

However, in Chapter 9 we shall see how to avoid broken paths entirely, 
and we shall find that the concept of an “origin” for the universe can be 
replaced by a new form of symmetry. 


PROBLEMS IN 
THE QUANTIZATION OF 
INERTIA AND GRAVITATION 


§8.1. THE FREE-PARTICLE PROPAGATOR 

(LECTURE 18) 


In §4.6 we saw how to build the propagator for free Dirac particles from 
the infinitesimal propagator 


K(X + Q;X) = ± 

7 T 




where q { is the coordinate displacement between point X + Q and point 
X,q 2 — q t q\ and q — y i qr i , y i being the Dirac matrices. For a path r + going 
forward in time from point 1 to point 2, define an amplitude by 


N 

P(r + ) = lim n A *(ft. Or- x). (2) 

T—l 


as in (88) of Chapter 40. Here we have identified the point Q 0 with point 
1 and Q n with point 2; Q v ..., Q N _ 1 are intermediate points chosen so that 
the steps Q r _ 1 to Q r are small. With a suitable choice for the measure factors 
A r , the path integral 
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gives 


K 0 + (2; 1) = 0(t 2 - /j) V u n (2) u n (\) 


n 


i 


2 7T 


6{t. 


*i)(f 2 - im ) 


5 ( 5 21 2 ) 


m 


2s 


0(s 21 2 )J 1 (ms 21 ) 


21 


(4) 


where u n are the free-particle solutions of the Dirac equation. The same 
infinitesimal propagator applied to paths going backward in time gives 


K 0 -(2- 1) = -9{t x - / 2 ) 2 u n { 2) u n (\) 

n 

= 27^ - a- 

The situation for the path integrals leading to (4), (5), is different from 
the nonrelativistic case in several respects. The infinitesimal propagator is 
now a 4 x 4 matrix. The product in (2) is a chain of 4 X 4 matrices in which 
adjacent members are in general noncommutative. In the nonrelativistic 
case, on the other hand, we have exp(idS) for the amplitude associated with 
a path element of action dS. The elementary amplitudes for a chain of such 
steps multiply as numbers to give exp(fS') for the finite path amplitude, 
leading to 


to + im) 


S(V) 


m 


0(S2 1 -)J 1 ( m S 21 ) 


2 s 


21 


(5) 


f exp [is(r 21 )yi>r 21 (6) 

for the nonrelativistic propagator. In this case the amplitude for a finite path 
has the same form as that for an infinitesimal step, namely, an exponential 
with the action as its phase. This property is lost in the relativistic case, 
in the sense that the finite propagator, whether /f 0 + , K 0 ~, or + K 0 ~), 

is diff erent from the infinitesimal propagator, not only in appearance, but 
in the important physical respect that, whereas the infinitesimal propagator 
from point 1 is zero except on the light cone of point 1, the finite propagator 
is nonzero off this light cone. The Bessel function J x in (4), (5), is nonzero 
inside the future and past light cones of point 1, respectively. It is interesting 

to consider this difference in terms of the Dirac equation for the finite 
propagator, 
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(f 2 + im)K( 2; 1) = S 4 (2; 1), (7) 

rather than in terms of path integrals. 

All forms of the finite propagator, K 0 + , K 0 ~, %(K 0 + + or 

(a propagator like K + , but with the reversed time sense) satisfy (7). Hence 
(7) alone does not specify the propagator uniquely. The situation here is 
analogous to the fact that the potential in electromagnetic theory is not 
determined uniquely by the wave equation. 

Let K (0) (2; 1) be chosen to satisfy 



P 2 K< 0> (2;1) = S 4 (2,1), 

(8) 

and let K {n \ 2; 1) be chosen so that 




f 2 K {n) ( 2; 1) = 

-im K in ~ 1) (2; 1). 

(9) 

Then 

K( 2; 1) = 

00 

2 K M (2 ; 1) 

n — 0 

(10) 

satisfies (7). Since 

1(2; 1 ) = 

= L ^ 

(ID 

has the property 

r 2 * r 2 /(2; 1) = 

□7(2; 1) = 5 4 (2, 1), 

(12) 

it is clear that the choice 




K i0) ( 2; 1) 

= f 2 /(2; 1) 

(13) 


meets the requirement ( 8 ). Comparing ( 8 ) and (9), we see that K is a 
Green s function for the determination of K^ n) in terms of K^ n 


K m ( 2; 1) = -imj K m (2; 3)K in ~ 1) (3; I) dr 3 , (14) 

the integral being over all spacetime. Reducing K (n ~ 0 in the same way, 
and so on, we get 

K m (2\ 1) = (— im) n f K (0> (2; 2 + n)K <0) (2 + n; 1 + n)... 


K {0) ( 3; \)dT 3 ...dr 2 + n 


(15) 
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The choice (11) for 7(2; 1), being time-symmetric, leads to the symmetric 
form 


1[K+(2;1) + Ko(2;1)] (16) 

for K( 2; 1). If in (11) we had used 6 + (5 , 21 2 ) instead of 8(s 21 2 ), the same 
equations (12) to (15) would have given 

K( 2; 1) = K + ( 2; 1), (17) 

and 5 _(*s 21 2 ) in (11) would have given K_. 

For n — 1 we could already have chosen 

K {1 \ 2; 1) = — im 1(2; 1), (18) 

since (9) for n — 1 is then satisfied identically. The form (18) can also be 
obtained from (14). Thus 

K il) (2; 1) = -im f K i0) (2;3)K {0) (3; \)dr 3 = -im J f 2 7(2; 3) f 3 /(3; 1 )dr 3 

= im f f 3 7 ( 2 ; 3) f a 7(3; l)dr 3 = -im J 7(2; 3) D7(3; 1 ) dr 3 

= —im 7(2; 1), (19) 

a surface integral at infinity being set equal to zero. 

Except for a factor \ [which leads to the factor i in (16)], the first two 
terms of ( 10 ) are the same as the two terms in the infinitesimal propagator 
(1). The first two terms of (10) contribute only when the displacement from 
point 1 to point 2 is null, but higher-order terms contribute for nonnull 
displacements. It is these higher-order terms in (10) that generate the Bessel 
functions in (4), (5). Indeed, if the Bessel functions are expanded in powers 
of m, the terms are twice the corresponding powers of m in ( 10 ). 

We can think of (15) as a summation of all paths with n corners. The 
particle goes from point 1 to point 3 at < 7 t 3 , then to point 4 at Jr 4 ,. . ., to 
point 2 + n at dr 2 + ni and finally to point 2. The segments of such a path 
are all null. Since K {0) is the propagator for a massless spin-i particle (we 
can call such a particle a neutrino), we see that a particle with mass m going 
from 1 to 2 typically goes from 1 to some point 3 like a neutrino. At 3, 
unlike a neutrino, it is scattered with amplitude — im, whereon it proceeds 
like a neutrino to point 4, and so on until it arrives at 2. At all the points 
3,4,..., 2 + /?, it experiences scattering, as is shown in Figure 8.1. Evidently 
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FIGURE 8.1 



At all the points 3, 4, 5, ..., 2 + n, the particle experiences scattering 
with amplitude —im. 


we have a situation analogous to the perturbation expansion of electro¬ 
dynamics, the terms in (10) giving the different orders of the expansion. 
In place of the electric charge, we now have the mass appearing in the 
scattering amplitude. 

This description of the free-particle propagator suggests that the phe¬ 
nomenon of “mass’" has the character of a scattering field, a concept which 
agrees with the classical picture developed in the preceding chapters. To 
be consistent with the point of view developed for electrodynamics, we 
require the symmetric form (16) to represent the local propagator, not the 
time-asymmetric forms K 0 + , K 0 ~, K + , or K_. The above work shows that 
(16) can be represented by the expansion (10) with K {{)) given by (11), (13), 
and with K (n \ n > 1, given by (15). Since the integrations Jr 3 , dr 4 ,..., in 
(15) are over all spacetime, there is no restriction concerning time-reversals 
on the paths which appear in (15). However, the integrations give rise to 
extensive cancelations among the paths. The forms (4), (5), for K 0 + y K 0 ~, 
show that (16) is zero when point 2 is outside the light cone of point i, 
a property which develops when the integrations are carried through explic¬ 
itly. Indeed, the paths of free particles with time-reversals are self-canceling, 
a property with which we are familiar from Chapter 4. This same property 
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is shown by the fact that K 0 + can be obtained from a path integral with 
respect to T+ paths, and K 0 ~ can be obtained with r~ paths—no time- 
reversals occurring in either case. 

We come now to a critical issue. The integrations over dr 3 , dr 4 , . . ., in 
(15) cover all spacetime. For this situation to be correct, we have to be sure 
the '‘mass field” m is everywhere the same throughout the universe, and 
also that the particle in question is not subject to a process in which it is 
changed into some other kind of particle. Since we cannot be sure of these 
things, we proceed by restricting the integrations over dr 3 , dr 4 , ..., to a finite 
region, say, V, of spacetime which is nevertheless large compared to the local 
system in question. The integrations cover all paths that go from 1 to 2 
without leaving V. However, there will now be paths from 1 that go outside 
V. Those paths that reach 2 as a consequence of paths going outside V from 
1 lead to what we might call a "response of the universe,” in analogy to 
the situation in electrodynamics. The response can be written quite generally 
as the sum of a part which extends the integrations dr 3 , dr 4 ,. .., in (15) 
to cover the whole of spacetime, so that \ • (K 0 + + K 0 ~) again forms a part 
of the total propagator, and an as-yet unknown response propagator, say, 
k( 2; 1), satisfying the homogeneous Dirac equation 

(f 2 + i m ) k(2; 1) = 0 (20) 


inside V We have 

K( 2; 1) = j(V( 2; 1) + A 0 "(2; 1)] + <2; 1). (21) 

We carry the analogy with electrodynamics still further by adopting the 
notation: 


K( 2; 1) 


ret 


A 0 + ( 2; 1 ) = en. 


h) 


/F u (2 )u ( 1) T- ^ u(2 )ll (1) 


E n >0 


E„< 0 


n 


A'(2; l) ret+ + K( 2; 11. , 


( 22 ) 


K( 2 ; 1 ) 


adv 


A 0 -(2; 1) 


0(t 1 t 2 ) 


7 , u(2)u(\) + ^ u(2)u(\) 


E„> 0 


71 


E n <0 


= A(2; 1 ) adv+ + A'(2; 1 ) adv _. 


(23) 


Here we have simply separated K 0 + , K 0 ~, into positive- and negative- 
frequency parts. We now take 
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k(2; 1) = j [( K let+ - A adv+ ) + (A' adv _ - A' ret _)]. (24) 

Again notationally, if we define a time-inverted propagator K' by 

^ ret — ^adv’ K adv = A ret , (25) 

so that (24) can be written as 

k(2, 1) = — [A ret+ — A adv+ ) + (A' ret _ — A' adv _)], (26) 

we have exactly the same form as the electrodynamic response condition 
given in (124) ot Chapter 4. [In the latter condition, the potentials A i come 
from paths and A\ come from conjugate paths. In the electromagnetic action, 
the line integrals involving A { and A\ have opposite signs, which implies 
opposite time-sense.] 

Substituting in (24) trom the identifications in (22), (23), we get immedi¬ 
ately 


k(2; 1 ) 


1 

2 


2 u n (2)u n (l) - V «„(2)m b (1) 


E„> 0 

n 


E n< 0 


(27) 


irrespective of the time-order of points 1 and 2. Inserting (27) in (21) then 
gives the important result 


K( 2; 1) = K + ( 2; 1). (28) 

Our point of view is that, although \(K 0 + -(- K 0 ~) is the local propagator 
for free Dirac particles, the response of the universe added to the local 
propagator gives K + . This result now replaces the hypothesis of Chapter 4 
in which we assumed the positive-energy states of a free particle to be propa¬ 
gated by Kf and the negative-energy states by K 0 ~. 

The propagator K + ( 2; 1) falls off as exp( — ms 21 ) outside the light cone 
of point 1. This extension of K + outside the light cone is usually regarded 
as a small effect. But with path integrals, we are concerned with steps much 
smaller than m~ l . Indeed, we have to think of s 21 — as a very large 
distance. The K + propagator thus permits a particle to arrive at point 2 even 
though the displacement from 1 to 2 is grossly spacelike. Unless we abandon 
our physical concepts in a very major respect, the particle cannot have come 
directly from point 1. It must have arrived via the universe. 

As an aside, it is worth noting that the perturbation expansion (97) of 
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Chapter 4 is obtained in Appendix 3 by a method which uses AT 0 + as the 
local propagator for the forward time-sense, and K 0 ~ for the backward 
time-sense. It is not hard to show that \{K {) + -f K 0 ~) leads to the same 
expansion. However, one must not use K + in calculating such perturbations, 
since only local paths are involved in a local electrodynamic perturba¬ 
tion—the use of K + would involve electromagnetic integrals over paths going 
far away into the distant universe. The propagator K + must only be used 
in such a calculation when the particle is free. 


§8.2. MASS INTERACTIONS IN QUANTUM 

MECHANICS (LECTURE 19) 


To the first order in the mass, we can write the infinitesimal propagator (1) 
in the form 


K(X+ Q\X) 


4- q S’(q-) exp (y imq 


1 


1 


2T eXp \ 2 lm 


)p9% 2 ), 



since to this order 


) = 1 + j imq, (30) 

and q 2 8'(q 2 ) = —8(q 2 ). The appearance of the exponential factor in (29) 
is interesting in relation to the Schrodinger theory, where the amplitude for 

a path f + A 2 of particle a from A to A is 




to sufficient accuracy, with t = 0 at point A and t = T> 0 at A. This 
amplitude cannot be used for Dirac particles because it does not contain 
spin. It is easy, however, to generalize the lefthand side of (31) to a 4 x 4 
matrix. Working in Minkowski space, from 


da 2 — ri ik da 1 da k , 
y%y k + Yfc Yi = 2 / tj **., 


(32) 
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where y 4 are the Dirac matrices and / is the unit 4x4 matrix, it follows 
that 


/ * da 2 = (y i da 1 ) 2 . 

This relation suggests the generalization to 


(33) 




detail that (29) contains a factor i which does not appear in such an at¬ 
tempted generalization of the Schrodinger theory. The major difference lies, 
however, in the ^ 8(q 2 ) factor in (29), which does not appear at all in a spin 
generalization of the Schrodinger amplitude. It is just this additional factor 
that produces the essential difference between nonrelativistic quantum 
mechanics and the relativistic theory. It is this factor which forces the 
segments of paths to be null, and which causes most of the difficulty in 
extending the path-integral method to the relativistic theory. 

Must we simply accept the t8(q 2 ) factor in the infinitesimal propagator, 
or can we understand how it arises in physical terms? Referring back to 
the Schrodinger amplitude on the lefthand side of (31), we may note that 
if we let m —oo, all paths within the light cone of point 1 have infinite 
action. Because of phase cancelation, such paths make no contribution to 
the path integral determining the propagator. There would be no timelike 
classical path determined by 8S = 0 in such a case. The only way to obtain 
a nonzero propagator would be for all elements of paths contributing to 
the propagator to be null, just as is in fact required by the f8(q 2 ) factor. 
Next, we would need to understand the physical meaning of m oo. 
However, we shall refer back to this question in Chapter 9, and turn for 
the moment to answering a more accessible question. 

In §8.1 we found the property of “mass” to show itself in quantum 
mechanics as a scattering effect analogous to that of the electromagnetic 
field. The analogy is shown again by the fact that the mass appears in (29) 
as a multiplying factor in the phase of an exponential—the electromagnetic 
interaction also appears as an exponential, as we saw in earlier chapters. 
These are indications that “mass” is to be interpreted as an interaction of 
a particle with other particles. This was the point of view we took in the 
classical theory. It is interesting to find the same situation emerging now 
in quantum mechanics. 

The mass interaction on an element da 1 ot a path of particle a is thus 
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\m d(f , where d(f = y t da\ This is the total interaction of particle a with 
all particles in the universe. The elementary interaction with particle b 
would be 


(34) 


— d<f G(A, B) d$, 


using the same propagator G(A , B) as in the classical theory. (Here we may 
recall that in Chapter 4 the electromagnetic interaction was found to be 
the same for quantum electrodynamics as for classical electrodynamics.) The 
interaction (34) relates the element d$ of a path of particle a to an element 
dty of a path of particle b , the points A, B , being at d^ dty, respectively. 
The symmetrization of the interaction with respect to the particle pair a , 
b , has thus led to a 4 X 4 matrix appearing both at A and at B. It is important 
to notice that the y matrices in d$ — y i da\ dty — y i db\ are not multiplied. 
The form (34) is to be interpreted in the sense of an outer product. 

At first sight it might seem as if the mass of particle a resulting from 
summing (34) with respect to all elements of all paths of all particles b would 
then be a 4 x 4 matrix. However, when the transition element of (34) is 
evaluated with respect to the wave function \p b of particle b , it is found that 
the outcome is indeed a scalar contribution to the mass of a . The relevant 
path integral is worked out in Appendix 5 (p. 256), and leads to 



(35) 


The mass of a is therefore given by 



(36) 


b 


the integrals in (35), (36), being taken over all spacetime. This compares 
with 



(37) 


b 


in the classical theory. In place of the unique classical path of particle b 
in the line integral of (37), we now have a distribution of paths represented 
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by the wave function i p b , which gives rise to a scalar contribution 


X Pb( B )M B )^ T B 


from the tour-dimensional volume element dr B . 

When the behavior ot particle b can be discussed within the framework 
of the nonrelativistic approximation, the scalar ^ b \P b is not much different 

from the time component of i.e., the probability density. To see this, 

use the Dirac representation 



(38) 


Writing 



(39) 


\p 2 is then less than \p 1 by the order of the velocity of the particle, and 


Mb = My^b — 


(40) 


which is also the large term in xp b y 4 \p b . Since the nonrelativistic approximation 
can be used for most of the particles in the universe, the dominant contri¬ 


bution to the mass of a particle is therefore to be calculated from the 
probability densities of the particles. Instead of a unique classical path 
weighted by proper time increments db, we now have a four-dimensional 
distribution weighted essentially by the probability density. 


Conceptually, we are not required to change our earlier classical picture 


of the origin of mass in any very major way. But what of the theory of 
gravitation? What happens if we attempt to generalize our ideas from 
Minkowski space to a Riemannian metric? These questions will be con¬ 
sidered in the rest of this chapter. 


§8.3. DIRAC PARTICLE IN A GRAVITATIONAL FIELD 

(LECTURE 20) 


We write the Dirac equation in the form 


y l \p ;i + im\p = 0, 


( 41 ) 
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taking the representation for y l such that the spinors making up \p are 
separated: 


In Minkowski space 



with 





being the three Pauli matrices, and o 4 a n the unit 2x2 matrix. The spinor 

* • 9 9 

suffix a = 1,2, refers here to rows, and ft = 1,2, refers to columns, whereas 
• • 

in o iafi we take ft to refer to rows and a to refer to columns. Contravariant 
and covariant spinors are related by 


where 


U a - 

- e a Pup, 

11 p — 



9 

U a - 

m m 

“ u /3> 

Vp = 

e a pV a , 

(45) 

f 12 ' 

12 

= e = e X 2 

• • 

= £ 12 

= 1 , 


£ 21 = 

21 

—» P W 1 1 ■ P • • 

- t — ^21 

= £ 21 

= - 1 , 


£ 11 = 

= «22 = £ ii : 

— Too ■ 

imi W 

= o, 


6 11 = 

= e 22 = e 1 ’ 

• • 

22 

= e 

= 0. 

(46) 

easily obtains 




• 


• 

1,2,3; 

(47) 



The mixed spinor e a n is given by 




( 49 ) 
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where 8 a ft is the Kronecker delta. It is not hard to see that 






Still working in Minkowski space, we have 





After a Lorentz transformation, the same \j/ would therefore continue to 
satisfy (41), provided the matrices y % transform like a contravariant vector, 

A 

i.e., provided a la/3 , o l a ^ transform like contravariant vectors. It is well-known 
that a spin transformation. 




can be found such that a combination of a Lorentz transformation applied 
to the index /, of c y a applied to the index a of o l a h, of — c y a applied to 
o ia/3 , and of the complex conjugate transformation applied to /?, has the 
net effect of leaving y l unchanged. From this important property, it follows 
that we can continue to use the same y l in (41) after a Lorentz trans¬ 
formation, provided an appropriate spin transformation is applied to the 
wave function \p. 

It is implicit here that the spin transformation (52) applies not only to 
\p but to because the coefficients defining a Lorentz transformation are 
constants, independent of spacetime position. The coefficients c a y are there¬ 
fore also independent of position, and can be taken outside the derivatives 



Next we consider how the above discussion is changed when a gravita¬ 
tional field is present. At any point we can choose a neighborhood in which 
the local metric can be taken to have the Minkowski form. With a suitable 
choice of coordinates, equations (41) to (53) hold in such a neighborhood. 
Let x l be general coordinates applicable to all spacetime, with metric 


ds 2 = g ik dx { dx k . 


( 54 ) 
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In the neighborhood associated with a point X, make a coordinate trans¬ 
formation x' 1 = x' l (x) such that 

ds 2 — r\ ik dx' 1 dx' k + 0(ds 4 ). (55) 


From what has just been said, equations (41) to (53) can be taken to hold 
with respect to the x fi coordinates in the neighborhood in question. 

Consider now the inverse transformation x l = x^x'). Letting the matrices 
o % transform like a contravariant vector, we get 



with respect to the x j coordinates. Raising and lowering indexes, using g ik , 
g ik , in the usual way, we have 


Also 


Crlllftry • —— p 

6 6iy<r oik 


dx’’ dx' 1 ° y ° 


ft rx l • — Tc® c ft • 

— y c cr- 


V a 7ff 






dx l _ dx k 
dx'* dx' 1 




2 dx^ dx^_ n 
dx'* dx’ 1 1 


2 g 


ik 



Defining y l now by 1 


yi = ( g ^p g, (f)> ( 59 > 

what meaning must we attach to xp ;i in order that (41) may be taken to 
hold throughout the whole ot spacetime? For the derivative of any spinor 
function of position u a , write 



i 




where the coefficients L fi ai of the affine connection transform as a covariant 


'Once again we note that in writingg' a/3 as a 2 x 2 matrix, a refers to rows and ft to columns, 
whereas for g ia/i the situation is reversed, with a specifying columns and ft rows. 
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vector with respect to the index i. Denoting the complex conjugate of L^ ai 

b y L<S «v the equation corresponding to (60) for any conjugate spinor v fl 
is 



Consider a spin transformation u a -v u ' a = c/u^. Let L ,fl ai be the affine 
connection giving u' a . if 



a; i 





Provided L //3 ai satisfies 


c JT 'P 

'"ft ^ ai 


cjLy pi 



we have 



^ a;i ^ ft ; i’ (^4) 

and the conjugate equation gives 

+ 

V 'j3;i = C ft V y;i- < 65 ) 

With this interpretation of spinor derivatives, and taking y l to transform 
like a contravariant vector, equation (41) is covariant with respect to both 
coordinate and spin transformations. 

Using the unimodular property of the spin transformation (52), the choice 



can be shown to satisfy (63). If we proceed to differentiate g l a h, using both 
the Christoffel symbols r and the affine connection L, 



d g 


aft 


+ F l jkg k aft 


dx J 


T ~Y _ 7 G • ff ^ • 

^ ajg yft ^ ftjg acr> 



then it can be shown (see Appendix 6, p. 257) that the choice (66) also 
leads to 
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This is analogous to the situation in Riemannian geometry, in which covari¬ 
ant differentiation is defined in such a way that g ik;l = 0. 

The Dirac equation (41) for a particle in a gravitational field must be 
interpreted with respect to the more general form (59) for the y x matrices, 
and with respect to the differentiations (60), (61). We cannot proceed for 
general coordinate transformations in the same way as for Lorentz trans¬ 
formations. It is not possible to find a spin transformation c a /j (X) which 
on being applied to g l a p restores y x to the form (43) at all points X. To 
do so we would require 



c/c/gV = c 


y c -U -IlL 

<* L P ° Y x fix'! 



x' j = x' j (x) being the coordinate transformation that reduces (59) locally 
to the form (43). It is not hard to show that (69) can be expressed in the 
form 



c 

a X 


st < * X sj k 

°i ° yp- 



The matrix c/ is unimodular, 

e y\ C ot yc p X — e a/3 > (7^) 

containing only six real parameters, whereas for a general coordinate trans¬ 
formation there are 16 independent quantities on the lefthand side of (70). 
Evidently, then, it is not possible to find a spin transformation that satisfies 
(70) generally. For the special case of Lorentz transformations in Minkowski 
space, there are only six independent quantities involved in the coefficients 
dx' k /dx x \ so (70) can be satisfied for that case. 


§8.4. THE GRAVITATIONAL FIELD OF 

A DIRAC PARTICLE 

In the classical theory a particle is considered to follow a unique path. 
Associated with the path is an action 5, which is used in two ways. Writing 
8S = 0 for a variation of the path with respect to a given gravitational field 
leads to equations determining the path. On the other hand, varying the 
geometry, g. ik —>- g ik + 8g ik , with respect to a given coordinate path leads 
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to a nonzero variation of the action, which can be written in the form 




V-g d A X, 


where 


T' % \ mat>W = / 


8 4 {X, A) 

V-gO) 


m(A) 


da 1 da k 
da da 





the notation being the same as in previous chapters. In this section we shall 
follow a similar procedure for the quantum case, not in terms of the path- 
integral method, but from a more empirical point of view, our aim being 
to obtain the quantum form of (73). In the classical case, (73) determines 
the effect of the particle on the gravitational field, in accordance with the 
Einstein equations [or with equations (117) of Chapter 6]. We seek now for 
the corresponding gravitational effect of a particle according to quantum 
mechanics. 

In place of a unique classical path, we have a wave function xp satisfying 
(41). We ask for the form of the invariant action S that would lead to (41), 
starting from the condition 

8S = 0, ip xp + 8xp, (74) 

the metric tensor g ik (X) being considered a given function of position. The 
answer to this question is 


S = f (tyyty-j - rnW)V—g d*x, 



the condition (74) being applied to an arbitrary finite 4-volume V, with S\p 
set zero on the boundary of V. Here xp is given by 

4 , = (v a , u0, (76) 


and 


'pY }x P-j = g la/3 [ U (3 U cc,j + V a U P,)]> 


xPxP 


U„V a + UpV^. 


a 


(77) 

(78) 


In order to verify that (75) does indeed lead to (41), we have to consider 
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the variation 8S resulting from 

u a ^>-u a + Su a , Vp Vp + dVp . 
Consider the particular term 

i 




where 





L p a ,Su fl . 



Integrating by parts, and using 8u a = 0 on the boundary of V, we can write 
(79) in the form 



The integrand of (81) is an invariant under coordinate transformations. At 
any point X choose coordinates such that dg ik /dx ] — 0 for all values of /', 
k, /. Then the term in brackets in (81) can be written at X in the form 


g’ a/3 V-g 






Since, moreover, our choice for L a yj gives 



this being the contravariant spin form of (68), and T l - k = 0 at A, the term 
in parentheses in (82) is simply —L^ kj g ja ^, and (82) is just 

(83) 

Because (83) is of covariant form, the integral (81) is equal to 

->f g’ aP V-gUj3;j Su a d 4 x (84) 

'V 

in any coordinate system. It is now easy to see that the variation of (75) is 
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jojotfi 


mv a ) d 4 x 


4- f V ~g 8itj 3 (ig )a/i u a; j — mv fi ) d 4 x 
+ f SvJig^Vp.j 4- mu a ) d 4 x 

4- J V~g 8Vj8( — ig ja ^ V a;j + mu p )d 4 x 
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(85) 


The first two terms of 8S are complex conjugates, as are the last two terms. 
Hence we require the real part of the following expression to vanish: 



V —~g 8u a ( — igi a @Up. j — mv a ) d 4 x 


4- f V~g $v a (ig ja/3 Vp ; j 4- mu a )d 4 x. (86) 


With 8u a , 8v a , arbitrary, this requires 


ig ja i 3 Up ;i 4 - mv a — 0 , 
ig iaf3 Vp • 4 - mu a — 0 . 


Taking the complex conjugate of the first of these, 


(87) 


7 W1 7 ) — Q 


g U a;j + ltnv 

and raising and lowering indices in the second, 


( 88 ) 


g’ a fJ vP -,j + imU a = 

Equations (88), (89), combine to give (41), as we required. 

A similar argument to that which led from (79) to (84) gives 


(89) 


i I g> a/1 (U/}U a ;i + V a V p-j)V-gd 4 X 


1 . 

2 



i gMiu-pU t - Up.p a + v a v $ ., - v aii Vp)V-gd 4 x. (90) 


Hence (72) gives 
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_1 . 
2 



I g )a/j ( u f3 u a;j ~ lr P;j U a + V a V P;j ~ V or,j V '(l 


v„. s v a )V--g d 4 x 


m 


f (u a v a + UpV 1 *) 8 V — g d 4 x 


\ f Sg ik j / 8g ik T {max)ik V^g d*x, (91) 


our aim being now to calculate 7j mat)i/r It is to be noticed that the varia¬ 
tion of V--g on the lefthand side of (91) makes no contribution, because 

of (41). ' W 

From (56) it follows that g %a & and g % ^ a must be complex conjugates. In 

• • * 

spite of this restriction, a variation g ia P —>- g %a & 4- 8g la P contains 16 inde¬ 
pendent parameters, compared to the ten independent variations contained 
in g ik . This can be seen explicitly by considering the choice 


8 g 


iafi 


1 


2 


g k a>3 8g ik + O a yg %y P + 0 /3 xg mX , 


(92) 


where 6 a y + 8 a y is any infinitesimal unimodular transformation satisfying 
0 a a — 0. The choice (92) meets the requirement that 8g xafi and 8g lfia must 
be complex conjugates. From (58) 

2 8g ik = 8g ia Pg k a j } + 8g k a pg ia P. (93) 

Raising and lowering indices in the second term on the righthand side of 
(93), we have 

8gik _ 3gi/c + gM(0« a + o'Pp) = 8g ik . (94) 

The part of (92) involving the infinitesimal transformation on the spinor 

indices thus has no effect on the metric tensor. Nor has such a spin trans- 

♦ 

formation any effect on S. It follows that if we start with variations 8g ia @ 
of the form (92), components of 9 a /3 being nonzero, it is possible to apply 
an inverse transformation —0 a f3 + 8 (X /3 to the spinor indices in 5. After such 
an inverse transformation, 8g iafi is given by 



and this is without S being changed. 

We shall now calculate the variation 8S resulting from (95), noting that 
the variations 
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U a U a + V ft V ji + $ v /3> 

resulting from the transformation —$« fi + 8 a fi , and its complex conjugate, 

do not change S. In Appendix 7 (p. 259) we show that 8u a .j, Svp.j also do 
not change .S’, Accordingly, we have 


8S 


2 


i C 


Mfi^a; j 


U P;j U a + V ct V P\j 


v a ;j u i))'f-g d ' i x. (96) 


Substituting from (95) for 8g* a P gives 


8S 


1 . 

/ 


4 


J ^g lk gk a/3 ( U j3 U a;j ~ U {3;i U a + V a V (i ; j ~ V a; j V ft ) V — g d 4 X 


1 


2 f S Z’ k 


■^(mat )jk g d -T 


(97) 


from which we have 


4 Tmat),fc = ‘gk al3 ( u ff u a;i ~ u j3;i U a + v a v j}-J ~ V a-, V fi) 


+ tgfHUpKik ~ U i3;k“a + V tt V fi k - - V k Va). (98) 


In terms of \p, \p. 


T 


1 .r~7 


(mat); A: 


4 


i['Py k 'P;j + Hfp;k - 'P-jJk'P - P;k7jH 


(99) 


which is the required form of T imSLi)jk . 

For a free particle of momentum p l in Minkowski space. 


T, 


(mat);7c 


r± 

E 


PiPk 


( 100 ) 


and setting \p*\p = n for a beam of such particles of proper density n , we 
have 


TmatWfc = PjPk • (101) 

In the nonrelativistic case E ~ m , and (101) has the same form as a classical 
fluid of mass density mn. This close analogy with the classical case suggests 
that the gravitational effect of a Dirac particle is to be calculated by taking 
the energy-momentum tensor in the Einstein equation to be of the form 

(99). 








160 


8. PROBLEMS IN THE QUANTIZATION OF INERTIA AND GRAVITATION 


The corresponding classical energy-momentum tensor of a particle a is 


$ 4 (X,A) 

V-g(A) 


m(A) 


da j da k 
da da 



( 102 ) 


This differs from (99) in the critical respect that (102) is singular at the world 
line of the particle. If one were to solve the Einstein equations using (102) 
literally, instead of a smoothed approximation, the resulting metric would 
be singular at the particle. This difficulty is avoided by (99), which indeed 
can be regarded as a smoothed form of (102). This smoothing effect of 
quantum mechanics is critical to the concept of a “test particle”—that is 
to say, a particle which is considered to respond to the gravitational field 
without itself having a significant effect on the field. Smoothing on a scale 
much greater than the gravitational radius of the particle, 2 Gm, is needed 
for such a concept to be feasible. Unless the kinetic energy of a quantum 
particle is much larger than m, the wave function ^ smooths (99) on a scale 
greater than m~ 1 . Thus, as an order-of-magnitude statement, we require 


m 1 2 Gm. 


(103) 


This condition is satisfied for actual particles by the very large factor ~10 40 , 
a factor whose significance we shall consider in greater detail in the next 
chapter. 


§8.5. THE GRAVITATIONAL EQUATIONS 

(LECTURE 21) 

The discussion in the last two sections, although giving interesting results, 
is not logically satisfactory, since it associates a sophisticated quantum theory 
of particles with only a classical theory of gravitation. It would be best, of 
course, to lift our consideration of gravitation to a level of sophistication 
like that for particles, but such a program runs into a difficulty which we 
shall discuss in the next section. For the moment, then, we shall content 
ourselves with the opposite procedure, of degrading the theory of particles 
down to a level where it matches the classical theory of gravitation. This 
procedure has two advantages: first, it shows how the phenomenon of 
gravitation arises from a quantum point of view; and, second, it suggests 
an approach to a more refined consideration of the problem. 

The relevant relationship between particles and the classical theory of 
gravitation is contained in the quasiclassical quantum approximation. The 
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nonrelativistic Schrodinger equation for a particle of mass m in a potential 
U(X) at point X(x , t) is 


. dip 


i 


dt 



d 2 ip d 2 \p d 2 ip 


dx 


.2 


+ 


dy 2 

y 


+ 


dz 


2 


~h U\p. 


(104) 


It is always possible to choose real functions A(X), S(X), such that 


ip = A exp(/\S). 


(105) 


Thus writing ip = xp 1 + iip 2f where ip v ip 2 , are real functions of x , t , we take 

tan S — ip 2 /ip v ^ cos S = 'I'v Substituting (105) in (104), and separating real 
and imaginary parts, we get 


dS 

dt 



1 


2 m 


(VS) 2 + u 


1 


a 2 a 


2mA \ dx 2 



(106) 


dA A (d 2 S 
dt 2m \ dx 2 




+ — VS• VA 

m 



(107) 


In the quasiclassical approximation, the function A(X) is considered to be 
slowly varying with respect to the point X , so that the terms in (106) involv¬ 
ing second derivatives of A are small. Neglecting these terms, (106) becomes 
the Hamilton-Jacobi equation of classical mechanics 

— + . T ( v5 ) 2 + u = °- ( ios ) 

dt 2m 


And multiplying (107) by 2 A leads immediately to 


dA 2 

dt 


+ div 




(109) 


To interpret these equations, consider an initial time, / 0 , say. At any point 
x where A 2 is nonzero, define a velocity by 

v = — VS. (110) 

m 

With (110) as initial condition at jc, t () , consider the classical trajectory. At 
any time t such a trajectory defines a velocity which continues to satisfy 
(110), since (110) applies at all points of a classical trajectory—provided of 
course that S satisfies (108). By considering trajectories for all points x, / 0 , 
at which A 2 ^ 0, we obtain a velocity field satisfying (110), and in terms 


























162 


8 PROBLEMS IN THE QUANTIZATION OF INERTIA AND GRAVITAi ION 


of this velocity field 


+ di v(A 2 v) = 0, (HI) 

dt 

which is like the equation of continuity of a fluid. The classical trajectories 
are thus the stream lines of a “probability fluid.” 

All these results hold more generally than for the wave equation (104), 
being applicable to any quantum system with a classical analogue (Dirac, 

1947). 

Denote a particular streamline by a(t ), and for any point x , /, define £(/) 
by 

£(/) = x — a. (112) 


We have 

S(x,t) = S[a(t)] + [£• VS]a tt + .... (H3) 

Here ,S[a(/)] is the classical action evaluated for the path a{t). When higher- 
order terms than the first in £ are neglected, the phase factor exp(/S) is 
thus 


exp iS(x, t) — exp /'S [#(/)] * exp i(£ m VS) 

= exp z'S[a(/)] * ex P iijna *1), (114) 

and 

\p — exp iS[a(t)] * exp i(md A. (115) 

To this approximation, the wave function consists of the probability factor 
A(x, t), withal 2 moving like a classical fluid, an oscillating factor exp i(md • £) 
corresponding to momentum md , and a classical phase S[a(f)], which at time 
t is independent of x. This last factor plays no role in quantum calculations, 
since the wave function is considered at any time t to be arbitrary to within 
a constant phase factor. Yet it is this last factor which plays a crucial role 

of gravitation. 

In order to deal with a many-particle system, we refer to (115) as the 
wave function for particle a , and denote it by \p a . For particle b we have 
correspondingly 


in determining the classical theory 
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where rj(t) = x b — b(t). For particles a, b,... that are well-separated from 
each other, the total wave function is given by the simple product 


a> • • • 5 0 x Pa( X a’> O^Pb^b’ 0 • • • • 


(117) 


Appearing now in \p is the factor 



(118) 


a 


in which S[a(/)], *S[^(/)],. .., are the action values for the classical paths 
a(t ), b(t), .... Because we are working from the nonrelativistic Schrodinger 
equation (104), S [#(/)]... would in the first instance be calculated non- 
relativistically. However, it is not necessary to go to fully relativistic quantum 
mechanics in order to see that a relativistic calculation, 



(119) 


gives a better approximation for the free-particle case than 



( 120 ) 


In the nonrelativistic approximation to 



we have 


T — t 2 — t 1 > \x 2 — jCj| = \X\ say, 


( 122 ) 



(123) 


giving 



In terms of the Dirac representation 



y 4 


(125) 
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we get 


m \ 3/ “ 

~ ) exp (~ims 21 ). (126) 


Using (123), and remembering that in the Dirac representation it is the unity 
2x2 matrix in the upper lefthand corner of (126) that propagates the large 
part of the wave function, we obtain the free-particle propagator of the 
Schrodinger theory, 




(127) 


There is no need, however, to degrade the exp( — ims 21 ) factor in (126) to 
the form given in (127), and exp( — ims 21 ) is just the factor implied by (118), 
(119), for any one of the particles a, b,.... Thus we can regard the factor 
(118) as being computed relativistically with respect to the paths a(t ), 
b(t ) 9 .... 

Our quantum discussion is in Minkowski space, where the classical paths 
of free particles are straight lines, in which case (119) gives the four-dimen¬ 
sional distance between the endpoints of the integral, thereby yielding a 
phase factor in (118) of the same form as exp( — ims 21 ). 

When a gravitational field is present, we transform locally to Minkowski 
space, calculating the phase factor of a particle in the manner described 
above. Transforming back to the original coordinate system does not change 
the wave function—we saw in §8.3 that for general coordinate trans¬ 
formations y j is a contravariant vector, whereas ip is unchanged. Conse¬ 
quently the phase factor exp{/\S[a(0]} is present in a general coordinate 
system. Since the determination of S' according to (119) is invariant, we can 
indeed calculate exp{ /\S[«(/)]} without making the transformation to 
Minkowski space, and the total phase factor (118) for all the particles can 
be similarly calculated. 

Suppose we now admit a small change g ik — g ik + 8g ik . All three 
factors in the wave function (115) of a particle will be altered. Variation 
of A is less important than variation of S because the latter is a phase factor, 
so that for S > 1 only a small fractional change 8S/S is in general needed 
to make a large change in ip. And for particle systems considered over a 
time-interval very large compared to m -1 , the function S is indeed >1. 
Furthermore, the first factor of (115) is much more sensitive to a change 
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of S than the second factor, because |£| is small compared to the integral 
in (119). We see, therefore, that the factor (118) for a many-particle system, 
although playing no role in a purely quantum-mechanical problem, becomes 
the sensitive part of the wave function as soon as we consider a small change 
of geometry. 

To formulate an explicit problem, consider the wave function if on a 
spacelike surface a. The argument of (118) is 


~ 2 / w O) da > 

a 


(128) 


where the line integrals are taken up to a, and where we now consider the 
possibility that the mass may depend on position. Indeed, if we interpret 
m in the same way as in earlier chapters, (128) would become 



the integral with respect to db being unrestricted. It is not hard to see that 
(129) can be written in the form 



and that for variations of geometry at o there is no change in the second 
term of (130). The change of the first term is just the problem discussed 
in Chapter 6. 

We now take the view that to investigate any physical problem in the 
neighborhood of a, we must consider not a unique geometry at a but all 
geometries. Thus the wave function xp considered above is to be taken as 
the amplitude for a particular geometry. We have to add amplitudes for 
all geometries, which involves the summation 


2 ex P 

geometries 



( 131 ) 
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Only geometries close to that for which 


5 2 X // G(A ’ B)dadb — 0, g ik g ik + (132) 

a b 

will be recognized in this summation. Thus a physical problem considered 
at a particular location (neighborhood of a) picks out a geometry g ik deter¬ 
mined by applying (132) at that location. 

We found in Chapter 6 that (132) leads in a certain conformal frame to 
the Einstein equations. We now see that a solution g ik (X) of the Einstein 
equations is not to be interpreted as determining a unique geometry for 
the world; g ik (X) in such a case is the form of metric tensor which dominates 
a physical problem at X, but all other geometries, not satisfying the Einstein 
equations, are to be considered logically possible. 

Returning to (106), what is the condition for the terms in A to be small? 
Let the probability A 2 be distributed approximately uniformly through a 
spatial region of dimensions Then the terms in A are of order mr x L ~ 2 , 
whereas for a nonrelativistic particle dS/dt is of order m and (VS) 2 /2m 
is of order mv 2 . Thus the required condition is L > m _1 |v| _1 . 


§8.6. QUANTUM GRAVITATION (LECTURE 22) 

It is possible to define quantum gravitation in close analogy to quantum 
mechanics. We pass from the unique path of classical theory to quantum 
mechanics by summing amplitudes for all paths. Similarly, we go from the 
unique geometry of classical gravitation to quantum gravitation by summing 
amplitudes for all geometries. 

We could seek to give expression to this idea by generalizing the concept 
of a path integral to include all geometries as well as all paths. However, 
from the outset we encounter the difficulty that such a procedure contains 
redundancies. We cannot vary the ten components of g ik (X) for each point 
X and the paths of the particles independently. To see this, make a small 
change a l (r) — a l (r) + Sa l (r) in the parametric representation of a path 
a. Now choose an infinitesimal coordinate transformation x H = x H (x) such 
that a' l ( t) = a l (r) — 8a l (r) for all values of the parameter r. That is to say, 
in the coordinate frame the world line is represented by the same functions 
of r as before. The coordinate transformation changes g ik (X) to g ik (X) + 
s gik( x )> say. A Change of geometry g ik (X) g ik (X) - Sg ik (X) then 
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leaves g ik the same as before, and we are back to the original situation. 
Of the ten degrees of freedom of the g ik (X) at each point X , six belong 
to the geometry itself, and four give freedom in the labelling of coordinates. 

It follows that if we use a path integral containing a summation with 
respect to g ik (X ), as well as with respect to the paths of particles, we shall 
be obliged to restrict the integral in some way in order that the appropriate 
degrees of freedom do not appear more than once. The alternative is to 
formulate the geometry of spacetime in a way that has only six independent 
functions of position, instead of the ten functions of the metric tensor. This 
implies a departure from the usual way of describing Riemannian geometry. 
Hitherto, the quantization of gravitation has always been discussed within 
the framework of the first alternative. Here we shall follow the second. We 
shall not carry the discussion beyond the objective just stated, however— 
further discussion of quantum gravitation would go beyond the scope of 
this book. Nevertheless, we regard the following discussion as a necessary 
prerequisite to an effective quantum theory of gravitation, particularly for 
the strong-field case. 


A Subgroup of U 4 


By using the quantities g 1 ^, discussed earlier in this chapter, any vector 
A i in Riemannian space can be expressed as a spinor 



(133) 


Given A afi it is easy to recover A { . Thus 



Q ®ft • A 

Si 6 


2 A., 


(134) 


since g i a/1 g k a B = 2 g k . Evidently, then, A a j 3 is an equivalent way of specify¬ 
ing A- 

Physical vectors in Riemannian space can always be expressed in a 
particular spinor form. Writing A n ^ in the column form 



L = 1,2, 3,4, 


(135) 


then A v A 2 , are real, and A 3 , A 4 , are complex conjugates. This follows 
because the g l n i> are Hermitian. 
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Consider transformations of A L through unitary matrices 

Am — UmlAl’ 0 ^ 6 ) 

U ML being the components of a unitary matrix U. We ask for A M ' to have 
the same “reality property” as A L , namely A x \A 2 \ real and A 3 ', A 4 ', complex 
conjugates. It will turn out that the matrices U which preserve this property, 
and which hence correspond to real transformations in Riemannian space, 
form a six-parameter subgroup of the four-dimensional unitary group U r 
Taking, first, the case in which U is infinitesimal, we can write 

U = 1 + ieH, (137) 

where e is infinitesimal and H is Hermitian. The most general form of H 
is 


abed 
[ b e_ f g 

c f h k 

d g k l 

in which a , e , h, l are real. Writing 



with p, q, real, (136), (137), (138), and (139) give 


(138) 


(139) 


Am — A m + is 


ap 

+ 

bq 

“h 

cr 

+ 

dr 

> 

bp 

+ 

eq 

+ 

fr 

+ 

g 7 


cp 

+ 

fq 

+ 

hr 

+ 

kr 

• 

dp 

+ 

gq 

+ 

kr 

+ 

IT 



(140) 


Since /?, q, r, are arbitrary, the requirement that A/, A 2 \ be real and A s \ 
A 4 \ be complex conjugates gives 


a — 0, b imaginary, c — —d, 
e = 0> f=~g , h=-l, k — 0. (141) 

In terms of real a, ft, y, 6, A, jit, we can therefore express H in the form 
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That is to say, 


(142) 


where 



H T T y^3 T ^^4 T ^^5 T (143) 







I 

0 

0 

0 



0 

0 

0 





0 

0 

0 

0 


0 

0 




0 

l 

0 

0 



(144) 


are all Hermitian. 

We show now that all unitary matrices, whether infinitesimal or not, with 
the required property for A M ' are determined by Hermitian matrices of the 
form of (143). To demonstrate this result, we make use of the algebraic 
property that a unitary matrix U gives rise to a Hermitian matrix H through 
the relation 


iH = (1 - U)( 1 + U)~\ (145) 

The meaning of (145) is that iH when multiplied by 1 + U gives 1 — V. 
Expanding, we obtain 

iH = (1 - 10(1 - V + U 2 - U 3 ...) 

= 1 - 2U + 2 U 2 - 2 U 3 ..., (146) 

from which we see that if U applied to a real vector A L yields a real vector 
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A m ' then iH applied to A L also yields a real vector. Hence H must be of 
the form (143). Since (145) can be inverted to give 

U = (1 - iH)( 1 + iH)~\ (147) 


the result that U can be generated by a Hermitian matrix of the form (143) 
is proven. Moreover a matrix H of the form (143) yields a matrix U, by 
(147), with the required reality property. 

It is also not difficult to show that the subset of unitary matrices with 
the reality property in question form a group. If U v U 2 , operating separately 
on A l each yield a real vector, then so does the product U 2 U V The unity 
matrix obviously has the required property. If U has the required property, 
then so has iH given by (145). Now 

U- 1 = (1 + iH)(i - iH r 1 (148) 


can be expanded in powers of iH, from which we see that U~ 1 also has 
the required property. The unitary matrices in question therefore form a 
group, which is clearly a subgroup of U 4 . This subgroup is not normal 
(invariant) and so cannot be used to factor U 4 . 

There is a singular case which must be mentioned. It is necessary in using 
(145) that U shall not have — 1 as an eigenvalue. This case can be covered 
by a limiting argument applied to the parameters a, /3 , y, <5, X, jU, in (143). 


§8.7. A NEW FORMULATION OF 

RIEMANNIAN GEOMETRY 

Let a, /?, y, <5, X, ju, be single-valued, well-behaved functions of four real 
parameters x % \ i = 1, 2, 3, 4. We shall refer to these parameters as “coordi¬ 
nates.” Denote an ordered chain of infinitesimal hermitian spinors, each 

with an undotted and a dotted index, by a {n) a(i \ n = 1,2.Consider the 

following relations: 


n 


X 


in) 


2 Sir) 


a i 3 a 


(r) 


afi 


(148) 


r — 1 


Sin + 1 ) 


ia/3 




(149) 


Here U[x (n) ] is the unitary matrix given by (143), (147), with a, ft, y, 5, X, 
H, taken at the parametric values x\ n) . For a given chain a (n) a j 3 , we can solve 
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equations (148), (149), iteratively, beginning with x (0) 4 = 0, U[x ({)) ] = 1. By 

the spinors being infinitesimal, we mean that each term in the series (148) 
is infinitesimal. 

At each step of such a spinor chain, we thus have four coordinate values 
x\ starting with x 1 = 0 (/ = 1,2, 3,4) at the beginning of the chain. Geo¬ 
metrically, we can think of a spinor chain as representing a path in coordi¬ 
nate space, beginning at the origin of coordinates. The same coordinate 
values x l can be arrived at through different spinor chains. Thus we can 
have 





i 






i 


(150) 


for two distinct chains a {r) a p, a' {r) a p', r — 1,2,..., the identity of coordinate 
values occurring from n , n \ steps, respectively. The coordinate values asso¬ 
ciated with r < n for a {r) a h> and r< n' for a' {r) a h, would in general be 
different in such a case, corresponding to geometrically different paths in 
coordinate space. 

The functions g ia(i are determined by the parameters x l ; i = 1, 2, 3, 4. 
Defining 

2 g* = g iai3 g k aj3 (151) 

gives a Riemannian geometry with g ik as metric tensor. The geometry is 
determined by the six functions a , /?, y, 8 , A, jit, of the four parameters x\ 
Evidently, then, we have attained our objective, which was to specify the 
geometry in terms of six functions of the coordinates. 

To obtain a full quantization of both particles and geometry, we consider 
all paths in coordinate space for the particles (i.e., all spinor chains) and 
all functions a, P , y, 6, A, jit, for the geometry. There are no subsidiary 
conditions in such a formulation. 

It remains to consider how to interpret a transformation of coordinates 
x'% — x fi (x). In place of (148) we must use 







(152) 


and also invert the coordinate relationship to 

x { = x l (;0* 


(153) 
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Although more roundabout, the procedure is essentially the same as before. 
Suppose *'» is known. From (153) obtain x l {n) , and use (149) to determine 
g(n + Then make the next step in the spinor chain to determine x'\ n + x) 
from (152). The spinor chain itself is not affected by the change of coordi¬ 
nates. 

To interpret the above procedure, we note that, since a, ft, y, <5, A, j a, are 
given in terms of x\ we have to return from x fi to x l through (153) in order 
to determine the matrix U. The g ioc(i resulting from (149) must then be 
subject to the usual vector transformation before being used in (153). 


9 

FURTHER CONSIDERATIONS 

OF COSMOLOGY 


§9.1. DIFFICULTIES IN THE PREVIOUS TREATMENT 

(LECTURE 23) 

The discussion of Chapter 6 contained three unsatisfactory features: 

(1) The Friedmann cosmologies are inconsistent with the absorber theory 
of electrodynamics. We shall not discuss this issue here, but we will return 
to it at the end of this chapter. 

(2) When considered in the Minkowski conformal frame, the Friedmann 
cosmologies are seen to result from the time-derivatives of the mass field. 
The discussion at the end of Chapter 7 showed that these time-derivatives 
arise from the broken ends of the paths of particles which occur at the origin 
of the universe, a situation no different in principle from that in the steady- 
state model. It hardly makes logical sense to object to the latter model 
because it requires creation of matter (i.e., broken ends) and at the same 
time to accept the broken ends of the Friedmann models. 

(3 ) The form of the gravitational equations obtained in Chapter 6 follows 
along lines similar to the derivation of the canonical tensor in the electro¬ 
magnetic theory. Thus in (104) of Chapter 6 the equations are symmetric 
in the advanced and retarded masses. By using the mass-response condition 
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(115) of that chapter, we were able to convert (104) to the form (117) 
involving only retarded masses. Then we were able to choose a particular 
conformal frame in which the retarded mass was constant, and in this frame 
(117) reduced to the Einstein equations. 


We come now to an important point, glossed over in the previous discus¬ 
sion of the Friedmann cosmologies, namely, that these models do not satisfy 
the response condition (115) of Chapter 6. It follows that, in addition to 
the unsatisfactory points mentioned above, there is an actual inconsistency 
in the treatment of these models. To obtain the lefthand side of (115) for 
the Minkowski frame, we use 


G (M (X, B) = 




+ x 




Working in terms of the Einstein-de Sitter model for simplicity, with a 
uniform density L 3 of particles, we can fairly easily see that 


M (adv) r> T (X) = 




which is divergent. On the other hand, the terms on the righthand side of 
(115) are easily seen to be convergent. 


§9.2. THE OTHER HALF OF THE UNIVERSE 


We shall now show that both the second and the third of the above points 
can be resolved by dispensing with an origin of the universe. For Minkowski 
space, there is no reason why the universe should not exist in the lower 

half of the plane as well as in the upper half. The conformal transformation 
function Q from Minkowski space 


ds 2 = dr 2 — dr 2 — r 2 (d0 2 + sin-# dtp 2 ) 


(3) 


to Robertson-Walker space 


ds 


*2 


dt 2 - Q 2 (t) 


dr 2 


1 — kr 


- + r 2 (d0* + sin 2 # d<p 2 ) 


k = 0,ztl, (4) 


is singular at r = 0. This prevents r < 0 in Minkowski space from having 
an analogue in Robertson-Walker space, the implication being that, by our 
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working in Robertson-Walker space, half the physical universe becomes 
suppressed. 

This situation recalls the old horizon difficulty of de Sitter’s first cosmo¬ 
logical model. Instead of working with k = 0, Q = exp(///), in (4), de Sitter 
obtained his model in the form 



H 2 R 2 ) dT 2 - 



R 2 (d0 2 + sin 2 0 dy 2 \ 



where H 2 = \A, A being the cosmical constant. The coordinate trans¬ 
formation relating (5) to the Robertson-Walker form of the line element, 

ds* 2 = dt 2 — exp(2 Ht)[dr 2 + r 2 (d0 2 + sin 2 0 d<p 2 )], (6) 

is 


t = T -h 7-77 ln(l — H 2 R 2 ), R — r exp(Ht). ( 7 ) 

1H 

The form (5) is limited in the R , T, plane to the strip R < 7 / -1 , which maps 
into the region of the r, t , plane confined between the curve 

r — H~ l exp ( — Ht) ( 8 ) 

and the /-axis. Yet we do not restrict the r, /, coordinates to such a region 
when the Robertson-Walker line element ( 6 ) is used. The r, /, coordinates 
are then permitted to take any values in the half plane — oo</<oo,0<r. 
We pass to the R , T, coordinates back through the transformation (7), 
noticing that the transformation is singular on the curve ( 8 ), and taking the 
view that the restriction on R is only a mathematical construct. Such coordi¬ 
nate singularities are well-known in general relativity, and have no physical 
significance. 

In a similar way we now take the view that the singularity at / = 0 in 
the Robertson-Walker space is only a mathematical construct, due to an 
unfortunate choice of the conformal frame. There is a second half to the 
universe, which appears without difficulty when the Minkowski frame is 
used. 

The hyperplane r = 0 must still have exceptional significance. To see 
what this might be, consider the interaction 

f f G(A, B) da db (9) 

between particles a and b. This interaction formed the basis of the gravita- 
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tional theory of Chapter VI. It is irrelevant whether we consider the propa¬ 
gator G(A, B) to couple positively or negatively to da at point A, so long 
as we adopt the same sign for the coupling to db at point B. The sign of 
the coupling can be regarded as being determined by the sign of the constant 
A in the action (70) of Chapter VI. Since the action depends quadratically 
on A, a switch of sign evidently has no effect on the action itself. The signs 
of the mass fields (80) and (81) of Chapter VI are changed, but since the 
mass fields appear quadratically in the gravitational equations this change 
also has no effect. And the 7J mat) ife term (85) of Chapter VI is unchanged, 
since m(A) in this expression again involves A twice, once in the mass field 
and once in the definition of m(A) in terms of the mass field, as in (88) 
of Chapter VI. 

The situation would be different if the sign of A were switched at A but 
not at B. The sign of the action would then be changed, and this change 
would affect the behavior of particle a in an electromagnetic field. However, 
such a procedure would be unsymmetric with respect to particles a and b . 
It is possible to maintain symmetry, however, and yet to make use of the 
idea of sign change in the mass interaction, by introducing the following 
rule: 


The sign of the coupling is the same as the sign of t. 

For r 4 > 0, r B > 0, we have the situation we have considered hitherto. For 
r A < 0, t b < 0, we have the equivalent situation with negative coupling at 
both A and B. For t 4 > 0, r B < 0, however, we have positive coupling at 
A and negative coupling at B —and vice versa if r A < 0, r B > 0. This new 
rule is illustrated in Figure 9.1. 

On this basis, consider the mass field of particle a , 

m {a \X) = f G(X, A)\(A) da, (10) 

in which A (A) = ±e according as t a ^ 0, c being a positive constant. Work¬ 
ing in the Minkowski frame for the Einstein-de Sitter model, let A 0 be the 
point at r = 0 on the path of particle a . Draw the light cone of A 0 as in 
Figure 9.2. In the forward half of this cone, the mass field (10) is c/47 tt, 
where r is the three-dimensional spatial distance of point X from the particle. 
In the backward half of the cone, the field is — c/4t tt, and outside the cone 
the field is zero. For r x > 0, it follows that the only particles contributing 
to the total mass field M(X) = X a m (a \X) are those with points A 0 lying 
within the backward half of the light cone at X. For r x < 0, M{X) is similarly 



a b 



FIGURE 9.1 

There are four propagators, G(A,B), G(A,B), G(A,B), G(A,B), relating the points A, A, 
on the path of particle a to the points B, B, on the path of particle b. With t A > 0, 
t B > 0, t A <0, t B < 0, the propagators couple to the particles with positive sign at A, 
B, and with negative sign at A, B. 



FIGURE 9.2 

The mass field due to particle a at point X in the forward half of 
the cone from A 0 is e/(4nr). At point X' in the backward half of 
the cone, the mass field is —e/iAnr'). 
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determined by the particles having points A 0 lying within the forward half 

of the light cone at X. 

The above rule therefore leads to 


M(X) = 




where L~ 3 is the particle density in the Minkowski frame for the Einstein-de 
Sitter model. Thus for the inertial mass of a particle we get 

m(X) = X (X)M(X), (12) 

and provided we keep to our rule for A, the sign of m is always positive, 

m(X) = f ( 2 L~ 3 t x 2 , (13) 


whatever the sign of t x . Except for the trivial change of writing e rather 
than X 2 for the coupling constant in (13), this is the same as the inertial 
mass used in Chapter 7. 


§9.3. PARTICLES OF NEGATIVE MASS 


Why can we not have particles with the sign of X inverted, X = — € for 
r > 0, or X = +€ for t < 0? Such particles would have negative mass 
— m{X ), m(X) still being given by (13). It hardly seems satisfactory to ex¬ 
clude this possibility by a purely axiomatic appeal to the above rule, since 
otherwise the geometrical surface r = 0 would be represented as the physi¬ 
cal cause of the switch in sign of X. Rather must we argue the opposite 
way—that the switch in X determines the surface r = 0. 

Two particles, a of positive mass and b of negative mass, have an energy- 
momentum tensor 


8 4 (X,A) 

V-gO) 


da 1 da k 
m(A) - 

da da 



t\(X, B) 


m(B) 


db l db k 
- db. 

db db 



Such a pair would be capable of annihilation without release of energy. 
Particles of negative mass are therefore not antiparticles, since particles and 
antiparticles always annihilate with nonzero energy release. A general 
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assembly of particles, some of positive mass, others of negative mass, would 
tend to evolve until only particles of one kind remained, namely, the kind 
that happened to be in the majority in the first place. Here we have a reason 
why the sign of the coupling constant may be expected to be the same 
throughout substantial regions of the universe. Different regions could have 
different signs, regions with A = +€ giving rise to positive contributions to 
the total mass field M(X), and regions with A = — e giving rise to negative 
contributions. The case considered above, A = +€ for r > 0, A = — e for 
r < 0, is a simple example of this, with the universe separated into only 
two regions. In principle, however, the universe may be very much more 
complex than such a simple model. 

The general implications of a multiregion universe will be considered in 
§9.5. For now, we return to the question formulated above. Taking our 
region of the universe to have A = +c, what would be the properties of 
a particle with A = — e, assuming such a particle to exist temporarily in our 
region? Instead of being propagated quantum-mechanically by K + , i.e., 
with positive-energy states going forward in time and negative-energy states 
going backward, such a particle would reveal the opposite situation, with 
negative-energy states going forward and positive-energy states going back¬ 
ward. The appropriate propagator would be 




K_( 2; 1) 


2 “nGKO), 

E_ < 0 




2 U nQ) U n 0 )> C C* 


E n >0 


In terms of Hankel functions, 


KJ2; 1) = (f 2 - ini) 
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477 


S(s 2 


21 


) 
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Stts 


H, (2 \ms 21 ) 
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K_(2; l) 


(f 2 - im) 


1 


477 


S(s 2 9a ) 


m 


H 1 il \ms 21 ) 


Stts 


21 



(16) 

(17) 


The additional minus sign in K_ is related to the time sense. Inversion, 
T —>- — t, transfers the minus sign to K + . Otherwise the difference in 
behavior of a negative-mass particle from one of positive mass is determined 
only by the nature of the Hankel functions. In (124) of Chapter 8, we had 
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H™(ms 2 1 ) 


2 


1/2 


7rms 2 ! 


exp — i\ms 21 


T 



ms 21 1. (18) 


The corresponding asymptotic expression for H 1 il \ms 2l ) is 




/ 2 Y /2 

l 

377 \] 

( ) ex p 

1 1 ms 21 - 

2 /J 

\ 7Tms 21 / 

L \ 


m^ 2 i 1* 


(19) 


The quantity ms 21 in these expressions arises in the free-particle case from 
evaluating f mds. The phase 



determines the classical action for a positive-mass particle, as we saw in §8.5. 
The corresponding phase in (19) 


+ f mds , (20) 

also determines the classical action. The sign change here characterizes a 
particle of negative mass. The contribution to the gravitational field is 
opposite to that of a positive-mass particle. There is also a switch of sign 
in the charge-to-mass ratio, leading to a different orbit in a prescribed 
electromagnetic field. On the other hand, the orbit in a prescribed gravita¬ 
tional field is the same as for a positive-mass particle. 

The appearance of the K_ propagator is interesting. It fills a logical gap 
in the usual theory. Particles are propagated by K + in our region of the 
universe, but there can be other circumstances in which K_ is the appropriate 
propagator. By these considerations the universe takes a more symmetric 
form. 


§9.4. OSCILLATING UNIVERSE AND 

BLACK HOLES 

The Friedmann cosmological model with Jc = +1 when considered in the 
Robertson-Walker conformal frame does not expand indefinitely. It expands 
to a minimum density and then falls back, ultimately reaching a metric 
singularity. As we saw in Chapter 2, the interior solution for an infalling 
spherically symmetric uniform local body has the same property, and indeed 
has an identical mathematical structure. No recourse within the usual 
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gravitational theory has been found for preventing these singularities. Work 
by Penrose (1965) and by Hawking and Penrose (1969) shows that it is 
impossible to prevent singularities from arising in these problems—a result 
which is of serious concern to most cosmologists. This situation is easily 
resolved, however, in terms of the ideas we are discussing here. 

The Friedmann model with k = - hi was discussed in Chapter 7 in relation 
to the Minkowski conformal frame. The expanding phase of the model was 
represented by the expansion to infinity of a finite distribution of material. 
Some subtlety was involved in understanding the connection of the ex¬ 
panding and contracting phases—a connection more easily discussed with 
respect to the Robertson-Walker frame. The Minkowski frame is much better 
suited, however, to the contracting phase, since in the Minkowski frame there 
is no metric singularity. The finite cloud of Chapter 7 contracts back* to 
its initial configuration at r = 0, whereupon it expands again, now with 
decreasing r, to r —— oo. This second expansion is similar to the first 
expansion from r = 0 to t —+oo. As in the previous expansion phase, 
the cloud turns around at r —— oo, contracting back to r = 0, and begin¬ 
ning a new cycle of expansion from r = 0 to t— +oo. Evidently, by 
extending the Minkowski frame to include the lower half of the plane, we 
have arrived at an oscillatory model. This goal has often been sought, but 
has never been attained within the scope of the usual theory. It is not hard 
to understand the reason for this failure. 

So long as the mass field M(X) is nonzero, it is possible to choose a 
conformal frame in which M is constant. With respect to this frame, particle 
masses are constant and the conformally invariant gravitational theory then 
reduces to the Einstein theory. But with some regions of the universe making 
positive contributions to M(X) and other regions making negative contri¬ 
butions, there can be surfaces of zero M. An attempt to choose a frame 
with M constant then leads to artificial metric singularities at these surfaces. 
It is clear that the usual theory, starting with the assumption of constant 
particle masses, must break down at all such surfaces, and will therefore 
fail to make proper connections across them. This failure is why a satisfactory 
development of the k — +1 Friedmann model in terms of repeated oscil¬ 
lations has never been found within the scope of the usual theory, and why 
an oscillatory interpretation appears without difficulty in a conformally 

invariant theory. 

The same considerations apply to the implosion of a local body—the 
so-called black-hole problem. The metric singularity of the usual theory is 


* In the contracting phase we adopt the convention of measuring the time r backward. 
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an artifact of the assumption of constant particle masses. Just as in the 
k = +1 cosmological model, the implosion phase is followed by expan¬ 
sion—the black hole becomes a white hole. However, the white hole cannot 
appear with respect to the same external observer who witnessed the infall 
of the body into a black hole. It is an interesting problem, not investigated 
here, to determine the relation of external observers, one witnessing the black 
hole development and the other witnessing the white hole. It is an attractive 
conjecture, requiring proof, that two such observers must lie on opposite 
sides of a surface of zero mass. 


§9.5. NEW COSMOLOGICAL MODELS (LECTURE 24) 


Like the situation in electrodynamics, where we regard the universe as being 
made up of equal numbers of positive and negative charges, we can think 
in terms of equal numbers of ± couplings of particles to the mass field 
M(X). In electrodynamics we cannot consider an arbitrarily assigned four¬ 
dimensional distribution of the charges. We can certainly assign a distribu¬ 
tion on a three-dimensional spacelike surface, but the charges then give rise 
to an electromagnetic field that imposes conditions on the trajectories of 
the charges—in other words, only certain four-dimensional distributions of 
the charges are consistent with the electrodynamic equations. 

Similarly, we cannot consider an arbitrarily assigned four-dimensional 
distribution of ± mass couplings, because an arbitrary distribution would 
not in general be consistent with the gravitational equations. Just as the 
electrodynamic equations prevent large-scale charge separations from gen¬ 
erating exceedingly large potentials, so we may expect the gravitational 
equations to prevent exceedingly large mass values from being developed. 
However, because of the comparative weakness of gravitation, it is not 
necessary for positive contributions to the mass field to be as closely com¬ 
pensated by negative contributions as + and — charges are compensated 
in electrodynamics. We can think in terms of positive aggregations, and of 
negative aggregations, on a scale that we usually think of as “cosmolog¬ 
ical”—that is, on at least the scale of H~ x . Although the aggregations are 
“local” in relation to the whole universe, they are nevertheless very large 
compared to the separation of galaxies, for example. 

With positive and negative aggregations distributed to give equal average 
densities of particles with + and — couplings, the value of M(X) is as likely 
to be negative as positive, which implies the existence of at least one surface 
with M — 0. Such a surface is conventionally associated in cosmology with 
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the origin of the universe. However, the theory here does not require such 
an origin, nor does it require the particular zero surface indicated by astro¬ 
nomical observations to be unique. There could be many such surfaces. 

Consider a region of spacetime close to a surface M — 0, taking either 
the surface, or the relevant portion of it, to be spacelike. Setting up comoving 
coordinates in the usual way, the line element can be written in the form 

ds 2 = dt 2 + dx* 1 dx v \ jit, v — 1, 2, 3. (21) 

Provided the region is not too large, the sections t = constant can be taken 
to be isotropic and homogeneous, and (21) can be expressed in the Rob- 
ertson-Walker form, whence by Chapter 7 a conformal transformation can 
be found to reduce the line element to the Minkowski form, 

ds * 2 = dr 2 - dr 2 - r\dO 2 + sin 2 0 d<p 2 ). (22) 

Expressing M(r) locally as a power-series expansion in t, we have 

M(r) = 1bt + j XL~ 3 t 2 + ..., (23) 

where L~ 3 is the particle density and B is a constant with the dimensionality 
L~ 2 . The form of the t 2 term in (23) follows from the wave equation for 
M{X\ 


1 C 5 4 (2f, A) 

D X M + -RM =\Y I 4 - _ da, (24) 

6 a V-g(A) 

all particles in the region in question being taken to have the same A. 

Next we ask for the restrictions imposed by the gravitational equations. 
In the general form given in Chapter 6, these are 



— 3(T ik + <P ik ) T gitS^K — 





J_ ret)adv) 




[M ( reD + M (adv) ], 





[M i (ret) M^ (adv) 4- M^ (ret) M> dv) 


- g tk g pq M P {ret) K (Mv) l 




8 4 (X, A) 

V-g(A) 


A (A)M(A) 


da i da k 
da da 
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(28) 
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Because we wish to work here in general terms, we prefer to start with (104) 
of Chapter 6, involving M (ret) , M (adv) , rather than with the Einstein equations, 
which were derived from (104) with the aid of the response condition (115) 
of Chapter 6. Since A/ (ret) , M (adv) , are both solutions of (24), we can write 

M (ret) = a -f bj + -y AX -3 t 2 + . .., (29) 


M (adv) _ _ a + ( B _ + i-AL- 3 r 2 + . . . , (30) 

where a , b, are constants with dimensionalities L _1 , L -2 , respectively. The 
lefthand side of (25) is zero in the Minkowski frame. Moreover, only the 
44 component of T ik is nonzero, since the coordinates are comoving. It 
follows that the coefficient of g ik on the righthand side of (25) must be zero, 

UK - ~ = 0. (31) 

jL * 


Working to the second order in r 2 , we have 


□a: - 



_ [ M (ret) DAf (adv) 4- M (adv) DM (ret) 


^(ret)^(adv)| 


1 

2 


b(B - b). 



the terms in r and t 2 vanishing identically. Hence the condition ( 31 ) requires 
the coefficient of r in A/ (ret) or in M (adv) to be zero. Either we have 


A/ (re ‘> = a + Bt + }r\L~ 3 t 2 + ..., 

^(adv) _ _ a + 1 A£-3 t 2 + . . . > (33) 

or 

M <ret) = a + I \L~ 3 t 2 + ..., 

Jm * 


fyf(a,dv) 


a + Bt + ~X L‘ 3 t 2 + 


♦ ♦ ♦ 


( 34 ) 
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The gravitational equations applied to the homogeneous cosmological 
situation carry us no further than this, since the contracted form of (25) 
leads to (24), which has been used already to arrive at the terms in r 2 . To 
proceed, we must either use the response condition (115) of Chapter 6, in 
which case 


A/ (ret) = M (adv) , (35) 

and 

a = 0, B = 0, (36) 

or we must accept the possibility that a , B , could be nonzero. In the first 
case, we have the Einstein-de Sitter model once again, but now applying 
only sufficiently close to the surface M — 0—there is no requirement here 
for M (ret) and M (adv) to be equal to \XL~ 3 r 2 throughout the whole universe. 
In the second case, we have possibilities for new cosmological models, which 
we shall consider in the rest of this lecture. 

Neglecting terms of higher order than r 2 in the expressions for M (ret) , 
M (adv) , and defining G in the usual way, we have 

87 lG = = ^/(reO^ladv) ' ( 37 ) 

Since (37) is symmetric in A/ (ret) , M (adv) , the forms given in (33) do not contain 
possibilities different from those contained in (34). Using (34), and defining 

fi = BL 2 , v = aL, (38) 

we obtain 

877 G — 6L 2 ( — v 2 -f + yAjitL _ 3 T 3 + -^-A 2 L _ 4 t 4 ^ . (39) 

The righthand side here can be positive or negative according to the values 
of A, jit, v , t. When G and the inertial mass m , given by 

m = \M= — A[A/ <ret) + A/ (adv) ] = j L-\\iiL~h + A 2 ZT 2 t 2 ), (40) 

have the same sign, the particles attract each other, but when G and m have 
opposite signs, the particles repel. Again depending on the values of A, /t, 
v, r, we can have either case. 
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Switching the signs of ja and t leaves (39) and (40) unchanged. Hence 
no possibilities arise for r < 0 that are not present for r > 0. It is sufficient 
therefore to confine attention to r > 0. The model is determined by the 
numerical values of the dimensionless parameters A, ja, v. We shall restrict 
the following discussion to examples of special interest. 

A. v = 0, A ju. 0 

G and m are both positive; so the particles attract each other in this case. 
The character of the model is perhaps best understood by transforming 

conformally to a frame in which the inertial mass becomes independent of 
t. Taking 


Q = + \L~ 2 t 2 ), 


2 m 


T > 0, 


0 


( 41 ) 


we get 


When 


we have 


m* = L2 1 m — m 0 , 


G* = tt 2 G = 


3 A 2 (/jt/A + t/L) 2 


4tt m 0 2 Ifir/XL + r 2 /L 2 


t/L > n/ A, 


(42) 


(43) 


(44) 
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2m q 


L 3 r 2 , 


(45) 


G 


3 A 2 


4 77/77 () 2 


(46) 


The line element is transformed by (45) from the Minkowski frame to 


ds * 2 = 


A 4 


4 / 77 0 2 


L 6 r 4 [^/r 2 — dr 2 — r 2 (d0 2 -f sin 2 ^ ^(p 2 )] 


dt 2 — Q 2 {t)[dr 2 + r 2 (d0 2 -f- sin 2 ^ dtp 2 )]. 


(47) 
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Q(t) 
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(48) 
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The forms (46), (47), (48), are the same as those obtained for the Einstein-de 
Sitter model in Chapter 7, except for an interchange of starred and unstarred 
quantities due to our working here from the Minkowski frame instead of 
from the Robertson-Walker frame. 

The situation is very different from the Einstein-de Sitter model, however, 
when 



In place of the relation (45) to (48), we then have 




3 A/a 
87 t m 0 2 




(50) 
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A 2 /a 2 
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L 4 r 2 [dr 2 — dr 2 — r 2 (d6 2 + sin 2 # d<p 2 )], 
Q 2 (t)[dr 2 + r 2 (d0 2 + sin 2 # d(p 2 )\, 
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where 
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Instead of G* being independent of time, as in (46), we have 
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The definitions of the instantaneous cosmological parameters H , q 0 , 
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compared to 

t = jH- 1 , % = +J’ 



for the Einstein-de Sitter model. 

The mass density in the Robertson-Walker frame is given by 

a = m 0 L~ 3 Q~ 3 . 

Using (46), (48), (58), for t/L > ju/X, we get 

_ 3 H 2 
° ~ 8 ttG*’ 

whereas using (54), (55), (57), for t/L < \x/\ gives 




3 H 2 
4ttG* ' 





Summarizing, near enough to the surface ot zero mass we have a model 
with variable gravity, of the kind first proposed by Dirac (1937, 1938b). Such 
models have interesting astrophysical and geophysical properties. They have 
been considered by Jordan (1949), by Dicke (1964, p. 142), and recently 
by Hoyle and Narlikar (1972). However, as r increases, there is an eventual 
transition to the usual Einstein-de Sitter model. 


B. v ^ 0, A jit > 0 

Start with the previous case, and let v increase slightly from zero. The only 
change will be for r very close to zero, where 



With G negative and with the inertial mass (40) remaining positive, the 
particles repel each other. Indeed, since the denominator of (39) must have 
a zero at a finite positive value of r, very large repulsive forces arise close 
to the surface of zero mass. Such forces evaporate any small-scale conden¬ 
sations that might be present at r = 0, and hence serve to produce a homo¬ 
geneous situation. 

There are observational indications, particularly from the well-known 
microwave background radiation, that the universe must have been exceed- 






§9 6. IMPORTANCE OF THE CLASSIC TESTS OF GENERAL RELATIVITY 


189 


ingly homogeneous close to the surface of zero mass. It is not easy to 
understand within the compass of the usual theory why this should be so. 
Hence this case of small v has considerable interest in providing a possible 
resolution of this problem. 

More complex properties arise for larger values of v , but these will not 
be considered here. Enough has been said to show that a wide range of 
new cosmological situations, not found in the usual theory, can arise if we 
are permitted to make appropriate choices for the dimensionless parameters 
A, jit, v. This possibility for obtaining new models turns on whether the 
response condition (115) of Chapter 6 must be applied or not. This condition 
leads to A/ (ret) = M (adv) and thence to \x — 0, v = 0, thereby removing the 
new class of models. From an astrophysical point of view, it is desirable 
to find new cosmological possibilities. On the other hand, the response 
condition (115) of Chapter 7, with its analogy to the electromagnetic response 
condition, has deeper theoretical interest. How are we to decide between 
these alternatives? In §9.6 we shall consider the observational implication 
of changing gravity, making explicit reference to the case discussed above. 
The issue to be decided is whether in our particular region of spacetime 
the cosmological line element could be determined by (52), (54), with G* 
given by (55). We have to ask whether this possibility is consistent with all 
available data. 

§9.6. IMPORTANCE OF THE CLASSIC TESTS 

OF GENERAL RELATIVITY (LECTURE 25) 

We turn immediately to the issue just raised, our aim being to discuss here 
the possibility that the locality observed by the astronomer corresponds to 
the case v — 0, \\i > 0, r < \xL/X. The relevant equations in the Minkowski 
frame were 
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A/'reO ~ 1 AL“ 3 T 2 , M <adv) ~ uL ~ 2 T. 

2 


The conformal transformation 





gave 



dt 2 - -^4[c//- 2 + r 2 (dd 2 + sin 2 6d<p 2 )], 
m 0 L z 



where 


t 





together with 





For observations of galaxies, the line element (68) gives H~ l = 2t , 
q 0 = -hi, which agree well with recent work by Sandage (1972a, b). The 
determination H~ l ~ 20 x 10 9 years in this work gives t ~ 10 X 10 9 years, 
which is in satisfactory agreement with independent estimates for the age 
of our own galaxy. Thus there seems to be no conflict between the above 
scheme and the results of "observational cosmology.’' 

The variation of G* with t , according to (70), has many astrophysical and 
geophysical consequences, which also appear to be in satisfactory qualitative 
agreement with a wide range of data—so much so that it would not be 
unreasonable to conclude that the data give quite firm support to (70). 
Indeed an analysis of the situation is likely to leave one in enthusiastic 
support of the above model. It is disconcerting therefore to find the theory 
in conflict with the classic tests of general relativity, with the bending of 
light by the Sun, and with the rotation of the perihelion of Mercury. We 
shall be concerned with discussing these local solar-system problems. 

The difficulty does not arise directly from the time variability of G*, which 
can be taken in these particular problems to remain constant during the 
time intervals in question—only a few minutes for the light problem and 
—10 2 years for the perihelion motion. The factor Xjit/(m () L 2 ) in (68) can 
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also be taken to be constant, and ( Xpt/m 0 L 2 ) 1/ 2 can be absorbed as a scale 
factor into the r coordinate. We regard the Sun as giving rise to a central 
field essentially static with respect to the large-scale features of the observed 

system of galaxies. Taking spherical polar coordinates with r — 0 at the solar 
center, we can write 

ds* 2 = e f dt 2 — e 9 dr 2 — r 2 (d6 2 + sin 2 # dtp 2 ) (71) 

for the local metric, where f g , are functions of r only. We shall denote 
their derivatives with respect to r by/', g'. By choosing a suitable conformal 
frame, it is possible, as we shall show first, to express e f , e° , exterior to the 
Sun in the forms 

e ' =1 -7 + ^(7/ ^=>+7 + ^(7/ (72) 


where S is a constant with the dimensionality of length. This result would 
seem at first sight to lead in the post-Newtonian approximation to the same 
behavior as the Einstein theory—and hence to a satisfactory resolution of 
the present issue. We shall see why this is not so shortly. 

Take the conformal frame to be such that K = \ A/ (ret) M (adv) [ s independent 
of r. Expanding A/ (ret) , M (adv) , in the exterior in powers of 1 /r, we then have 

M (ret) = A/ 0 <ret) (1 +J + ...J, A-/ <adv) = M 0 (adv) (1 (73) 


where A is a constant with the dimensionality of length and M 0 (ret) , M 0 (adv) 
are the present-day cosmological values at large r. If (65) holds, 


A/ 0 (ret) 

M 0 (adv) 




whereas M 0 (adv) /T/ () (ret) is equal to the righthand side of (74) if (66) holds. 
It is the circumstance that the righthand side oj (74) is not unity in the present 
model that leads to difficulty with the classic tests of relativity. When, on the 
other hand, we use the response condition (115) of Chapter 6, we get 
M 0 (ret) = M 0 (adv) and the difficulty is resolved. 

With K constant, the gravitational equations exterior to the Sun take the 

form 
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terms of cosmological order being neglected. To the same approximation, 
we neglect terms on the righthand side of (75) involving time-derivatives. 
Working to order 1 /r 4 , we have 
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J 2 12 o 
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(76) 


3 <P 


44 


3 A/(ret) 

— A 2 e f ~ 9 0 M (adv) 

2 t ' 


-4 


0 


(77) 


Using known forms for R ik — \g ik R, after canceling ^A/ 0 (ret) M 0 (adv) , we have 


R 


li 


1 

2 


gn R 


/' 1 


9 


3 A 2 
71 5 


(78) 


~f+9 


R 


1 


44 


2 


£ 44 ^ 




3/t 


r 2 


(79) 


From (78), (79), 


/' + £' = ^r. (80) 

which gives 

'x 42 

f+8 =-- r U («i) 

since the asymptotic form of (71) at large r is the Minkowski line element. 
It follows that if we write 



then e° has the form 



Next we show that 5 = 0. Multiplying (78) by — re ! , we can easily see that 




+ O 



Substituting (82), (83), for e f , e\ the lefthand side of (84) is 


( 84 ) 
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Hence (84) gives B = 0 and (72) is established. 

If planets were to follow geodesic orbits in the conformal frame in ques¬ 
tion, all would be well: the situation in the post-Newtonian approximation 
would be the same as in Einstein’s theory. However, the equations of motion 
of a particle, say, a , are 


d_ 

da 



+ m(A)f 


kl 


da k da 1 
da da 




The orbit is not geodesic because m is not constant. 

Since this issue is critically important, it is worth interrupting our discus¬ 
sion to prove (86). The theory being conformally invariant, it is sufficient 
to obtain (86) in a particular frame, for example, the frame used above with 
K constant. Since ( R ik — lg ik R). k = 0, we have 

2g kl T ik;l = g* z [M/ ret) A4 (adv) + M* (ret) M/ adv) - gikg^M^M^lj 

_ A/.(ret) QA /(adv) + M/ adv) DA/ (ret) + g kl M k {rei) M i ./ adv) 


+ g fcI M fc (adv) A/ i; / ret) - g pq M p . . (ret) M g (adv) - g pq M p iTet) M q . > dv) . (87) 

Because A/ (ret) , A/ (adv) , are scalars it is possible to invert indices, M i; / adv) = 
M x . / adv) for example, so that the last four terms on the righthand side of 
(87) cancel in pairs. Hence 

g kl T ik;l = j M> et) DM (adv) + ~ M i (adv) DM ret 


1 

2 


M 


(ret) 


6 


RM ,mM + y 


a J V~i0*) 


A vV 




1 

H- M i 

2 


(adv) 


1 

6 


RM (rel) + Y f 8 ^ X ' A ) .. \(A)da\, (88 

T J J 


where we have used the wave equations satisfied by M {ret \ A/ (adv) . Remem¬ 
bering that K = iM (ret) A/ (adv) is being taken constant, and that 




1A [ A/ <ret) + A/ (adv) ], 


(89) 
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1 94 


it is easy to see that (88) leads to 





S 4 (X, A) 

V-g(A) 




An analysis of T lk . k , similar to that given at the end of Chapter 6, leads 
to 




W A) 

V-g(A) 




da 1 da m 
da da 




Thus for (90) to hold at all points X , we require (86) to hold individually 
for every particle. 

The next step is to determine m(X). Writing 


M (ret) _ M 0 iret) u(r ), A/ (adv) = A/ 0 (adv) u(r), (92) 


we have 


1 

□w H- Ru 

6 


M 


— v f 

(ret) J J 


0 


a 


S 4 (X,A) 

V-g(A) 


\(A) da , 


1 

□u H- Rv 

6 


M 


— S f 

(adv) ■ <: — J J 


0 


a 


S 4 (X, A ) 

V-g(A) 


\(A) da , 


(93) 


(94) 


the summations here being essentially confined to the particles of the Sun. 
In the exterior we have the expansions 



Working for the moment to order 1 /r, the sources of Dv must be opposite 
to those of H\u. By adding (93), (94), and putting u = 1, v = 1, in the terms 
involving R, we get 


R = 3 


whence 


1 


M 0 (ret) 


+ 


1 


A/ 0 (adv) 


j _S ^X,A) 


(96) 


□w 


1 

2 


1 


1 


M 0 (ret) M 0 (adv) 


2/ 


\ T /<T 




(97) 
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Uv = 


1 

2 


1 


1 


L JV/ o <adv) A/ 0 <ret) J 


V f S * (X ’ A) A (A) da. 


a 


V-g(A) 


(98) 


To obtain the constant A, it is sufficient to solve (97) in the Euclidean 


• • 


approximation, giving 


A 


NX 

877 


1 


1 


A/ 0 (re0 


A/ (l <adv) 



(99) 


where N is the number of particles in the Sun. Remembering that 


877 C 


3 


6 


K M 0 (ret) M 0 (adv) ’ 


( 100 ) 


we get 


A 


X 

6 


NG*[M 0 


(adv) 


M 0 (ret) 


]• 


( 101 ) 


Putting 


m 


0 


]_ 

2 


A[M 0 (ret) + M 0 (adv) ], 


( 102 ) 


(101) can be written in the form 


A 


j G*9l X , 


(103) 


where 


X 


M 0 (adv) - A/ 0 (ret) 
A/ 0 (adv) -f M (ret) 


AJl — Nm 


0 


0 


(104) 


Neglecting small terms, C M is the inertial mass of the Sun. 

From the 44 component of the gravitational equations, we have 


g' 1 
— + - 


9 


~~~K ef ^^44 + ^44) 


3 


K 


T 


44 


87rC*w 0 ^ J 


r SJX.A) 


a 


V-g(A) 


(105) 
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which gives 


S = 2 G*9H. 


(106) 


Hence 


and 


A = %S /6 


_ , ^ 1 D 

U — 1 + —-1- 7 T + • • • 9 

6 r r* 


(107) 


V 


1 


1 X S E 

— -1—7 + • • •, 

6 r 


(108) 


in the exterior. To determine D , E , we note that in the exterior 


□w = Di; 


]_ 

6 


R + O 


(*)• 


(109) 


whence to order r 4 we have 


d l r 2 e (f-g)/2du \ _ d ( 
\ dr) dr \ 


dr 


d l r 2 e (f-g)/2 dv\ 


dr 


( 110 ) 


Substituting for e f , e 9 , from (82) and (83), using (107) and (108) in (110), 
and equating the coefficients of 1 /r in the parentheses, we get 


D - E = -xS 2 * 


6 


(111) 


The conformal requirement that AT be independent of r imposes the condition 
uv — 1, giving 


D + E = j^x 2 S 2 , 


( 112 ) 


so that 


D = l2 XS2 + l2 x2S2 ’ E= ~l2 xS2 + l2 x2S2 ' (H3) 

It will be recalled that our aim in the above work was to consider planetary 
orbits, given by (86), with respect to the line-element (71) with e f and e 9 
of the form (72), S being given by (106). For the line element (71), (72), 
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there would be no difficulty if the inertial mass appearing in ( 86 ) were a 
constant. The orbits would then be geodesic, and there would be no differ¬ 
ence Irom the Einstein theory. But lor x 2 7 ^ 0 the inertial mass m is not 

a constant. The above results allow us to determine the form of m, starting 
with 


m = y A[A/ (ret) + A/ <adv) ]. 

4W 


(114) 


From (92), (107), (108), (113), we have 


A7 <re0 = M 0 (re<) (1 + 


Ixs 

6 r 


+ 


1 xS 2 


12 r 2 


+ 


1 x 2 S 2 


72 


+ 



(115) 


A/ (adv) = A/ 0 (adv) (1 


1 1 X 5 2 1 x 2 S 2 

6 r 12 r 2 + 72 r 2 


+ 


(116) 


Substituting (115), (116), in (114), and using the definition 


m 0 = j A[A/ 0 (ret) + A7 0 <adv) ], 


(117) 


we obtain, after an easy reduction, 


1 X 2 S 


m = m 0 1 1 


5 X 2 ^ 


6 


72 




(118) 


Next we recall that the model under investigation, with variable gravity 
according to (70), arises because either A/ 0 (ret) > M 0 (adv) , with (65) holding 

for A/ 0 (ret) , A/ 0 (adv) , or M 0 (ret) A/ 0 (adv) with ( 66 ) holding. In either case x 2 — 1, 
as can be seen from (104), so that 



m 


o 




(119) 


It will be shown below that (119) gives § of the Einstein value for the bending 
of light, and § of the Einstein value for the rotation of the perihelion of 
Mercury. Only if x 2 = 0, which requires M 0 (ret) = A/ 0 (adv) do we get the 
Einstein values. In §9.5 we found that the variable-gravity model evolves 
with increasing cosmic time into the Einstein-de Sitter model. In the latter 
we do have M 0 (ret) = Af 0 (adv) , and hence the Einstein values. During the 
transition from one model to another, x 2 changes from unity to zero. 
Correspondingly, there is a change for the bending of light and for the 
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perihelion rotation from values appreciably less than in the Einstein theory 

to the same values as in the Einstein theory. 

We turn now to the effect of (119) on the orbital equations (86). By writing 


m(A) 


d_ 

da 




( 120 ) 


we can express (86) in the form 

d 2 a i pi da k da 1 _ mm 1 
da 2 kl da da m 0 2 


( 121 ) 


Since m l = 0, / = 2, 3, 4, the equations for these components are the same 
as in Einstein's theory, except that a replaces a. There are similar integrals 
of motion (Eddington, 1930, p. 86), 



( 122 ) 


The line element gives 



the orbit being taken to lie in the plane 6 = tt/2. To within the post- 
Newtonian approximation, we can write e g = e~ f in (123), in which case 
the situation in (123) is as in Einstein’s theory except that —(m/m 0 ) 2 appears 
on the righthand side instead of — 1. Following a treatment similar to that 
given by Eddington, we write u — \/r and use (122) to express (123) in the 
form 


du 

d<$> 



c 


h 


+ Su 



o 

m v 


m 


(124) 


0 


Substituting for m/m 0 from (119), we now get 


du V 
d<}> / 


T u 


2 


c 2 - 1 .2 k 

-b —— U 


ti¬ 


ll 1 


TTj~ 1 + TT ku 3 -f 0(u 4 ), (125) 

2 n £ 2 


where 



k 


3 


(126) 
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Differentiating (125) with respect to <f>, and canceling du/d<p throughout, gives 

d 2 u , _ k k 2 , 9 , 9 ,^ 

d& + U ~h 2 ~W U+ 4 ku + -"- (127) 

From the first term on the righthand side of (127), we see that k takes 

the place that \ S holds in the usual theory. Therefore it is k , not S, that 

is determined by the observed planetary motions. But it is S, not A:, that 

determines the bending of electromagnetic radiation by the Sun. Hence we 
have 


k = —- S’ = observationally determined quantity, (128) 

whereas in the usual theory 

yS = same observationally determined quantity. (129) 

Evidently, S in the present theory is f of the usual S , and hence the bending 
effect is only J of its usual value. 

For the perihelion motion of a planet of eccentricity e , the first-order 
solution of (127) can be written in the form 

u = Y9 [1 + e cos (<t> + w )]- (130) 

n 

Substituting (130) for u in the post-Newtonian terms of (127), we get 
d 2 u , _ k k 3 rl , tJL , 

W + U ~¥~W [ + ecos ^ + 0 ] 

H—.. 4 [1 -h £ cos (<f> + <o)] 2 + .... (131) 

The coefficient of cos (<j> + co) on the righthand side of (131) is 4 ek 3 /h 4 , 
whereas in the Einstein theory the corresponding coefficient is 6 ek 3 /h 4 . Since 
this term gives rise to the perihelion motion, the present theory is seen to 
give a motion equal to § of the Einstein value. 

The possibility that observations of the bending of light might lead to 
a result less than the Einstein value has been considered by Dicke, and Dicke 
and Goldenberg (1967) have also suggested that if the interior of the Sun 
is rotating comparatively rapidly, the relativistic rotation of the perihelion 
of Mercury would need to be less than the Einstein value. However, the 
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data, while somewhat equivocal, appear to be against this point of view 
(Clemence, 1947; Muhleman et al., 1970; Sramek, 1971). Since the above 
deviations from the Einstein theory are more severe than those considered 
by Dicke, we must conclude that existing data make the variable-gravity 
theory based on equations (63) to (70) untenable—at any rate given the 
present value of the cosmic time r. Since the variable-gravity model evolves 
with increasing cosmic time into the Einstein-de Sitter model, for which 
the classical tests hold good, it would be possible to argue that variable 
gravity occurred at earlier epochs than the present, i.e., that transition to 
the Einstein-de Sitter model has by now taken place. But this would be 
a weak position. It is much preferable to argue that observations of the 
bending of light and of the perihelion motion show that A/ (ret) = M (adv) , and 
hence provide strong evidence for the response condition (115) of Chapter 
6, which point of view we shall adopt from here on. 


§9.7. ELECTROMAGNETIC RESPONSE 

OF THE UNIVERSE (LECTURE 26) 


We turn now to the first of the difficulties set out in §9.1. In the early chapters 
the electromagnetic theory was developed in terms of a time-symmetric 
interaction between particles. This theory was shown to lead to results in 
agreement with experiment, for quantum electrodynamics as well as for 
classical theory, provided the universe gave an appropriate form of response. 
However, none of the Friedmann models were satisfactory for meeting the 
required response condition. We ask now whether this situation has been 
changed by the considerations of §9.5. 

It will be useful to begin an answer to this question by illustrating the 
difficulty for the Einstein-de Sitter model, doing so with respect to the 
Minkowski conformal frame instead of to the Robertson-Walker frame as 
in the early chapters. We have 



(132) 


j^f (ret) _ Jifi adv) _ JL \L~3j 2 


2 


(133) 


m 



(134) 


where L 3 is the particle density, which is constant in the Einstein-de Sitter 
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model. Although most of the particles may become locally condensed into 



r, and we write N for the particle density of this residue. 



by 


Ne 2 


(constant)- -v if co > ^, 


(135) 


and by 


Ne 2 


(constant)- if v > co, 

mv 


(136) 


where v is the effective collision frequency. Remembering that co does not 
depend on r in the Minkowski frame, the required response condition is 
that the integral taken with t —co of (135) or (136), whichever is appro¬ 
priate, shall be divergent. Suppose that v does not tend to zero or to infinity 


as t y- yj . T he behaviour of the integral is then determined by 



(137) 


m 


and with m cc T 2 this is convergent, not divergent. Nor would the situation 


be helped by v —0 or by v —>- oo with r, since in either of these cases the 
integral would be made more convergent. 


The difficulty here is due to m —oo as r —>- oo, which is just the behaviour 
we were enjoined in §9.5 to avoid. It was repeatedly emphasized there that 


m oc t 2 applies only sufficiently close to a surface of zero mass. We cannot 
continue to use (133), (134), at large distance from such a surface. Provided 
the positive and negative contributions to the mass field prevent m oo, 
just as the plus and minus charge contributions prevent divergences of the 
electromagnetic potential, the difficulty concerning (137) disappears. 

The above considerations, together with those of §9.6, bear critically on 
the whole problem of response conditions. The situation can be summarized 
in the following terms: 


I. The classic tests of relativity, agreeing with the Einstein values for the 
bending of light and for the rotation of the perihelion of Mercury, 
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provide strong evidence for the existence of a mass-response condition, 
of the form given in (115) of Chapter 6 . 

II. The need for the electromagnetic-response condition to be satisfied 
implies that the universe does not possess the simple structure of a 
Friedmann model. There is strong support tor positive and negative 
contributions to the mass field, and for the view that the region of 
spacetime so far investigated by astronomers is but a small part of a 
far vaster structure. 


§9.8. LENGTH SCALES AND UNITS 

In §6.1 we saw how all of physics, excluding gravitation, can be described 
by: 

( 1 ) three dimensionless coupling constants, which are the fine-structure 
constant and constants for the strong and weak interactions; 

( 2 ) the particle mass ratios; and 

( 3 ) the mass of some chosen reference particle. 

It is (3) which introduces dimensionality into physics. The issue now is 
whether the inclusion of gravitation and cosmology throws further light on 
the previous situation. In particular, since we have developed a theory of 
the nature of mass, can we obtain insight into (3)? We might hope, for 
example, to determine the mass of a reference particle in terms of the 
cosmological structure. 

In the following discussion we shall continue, for simplicity, to use the 
Einstein-de Sitter cosmological model. First we note that the definition 

6A 2 

877 G = -^V, (138) 

with m given by (134), leads to a well-known relation between the cos¬ 
mological mass density a = mL ~ 3 and the Hubble constant H. In the 
Minkowski conformal frame, H and r are related by 

H = 2/t. (139) 


These relations lead easily to 



a 


(140) 
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It follows from this derivation that (140) was contained within the previous 
discussion, and is indeed no more than an expression of the definition of 

G. We see therefore that (140) cannot serve to improve our understanding 
of the issues set out above. 

We cannot hope to determine m from the gravitational theory alone, 
because the gravitational theory does not depend on the numerical value 
ot A 2 , whereas m is so dependent. The value of A 2 becomes important, 
however, when the gravitational theory is combined with electrodynamics. 
Writing (134) in the form 

A 2 = y(L//) 3 -(mH -!), (141) 

we have A 2 expressed as the product of two dimensionless numbers. (LH) 3 
is the reciprocal of the number of particles within distance — /Z -1 , whereas 
mH~ l is the ratio of the scale of the observed portion of the universe to 
the Compton wavelength of our reference particle. Empirically, we have 
the following known values 


(LH) 3 ~ 10- 79 , (142) 

mH- 1 ~ 10 39 . (143) 

The number in (142) arises within the gravitational theory alone, but the 
determination of (143) requires electrodynamics as well as gravitation. The 
number in (143) refers to the electron as reference particle. Other choices 
for the reference particle would not change the fact that the combination 

(mH- 1 ) 2 • (LH) 3 (144) 

is of order unity. Nor would the general order of magnitude of A 2 be 
changed, 

A 2 ~10" 40 . (145) 

How are we to interpret (145)? We could take the point of view that the 
emergence of a dimensionless number as small as 10~ 40 is nothing new. In 
the usual theory, G is introduced as an ad hoc constant (see §6.5) with the 
dimensionality of (mass) -2 , such that when multiplied by the square of a 
typical particle mass, for example, the experimentally determined mass m 
of the electron, it gives Gm 2 ~ 10 -40 . Since in the present theory G is related 
to A 2 by (138), one can seek to argue that our finding A 2 ~ 10~ 40 should 
be no surprise. This point of view hardly seems satisfactory, however, 
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because in the present theory A is the basic constant, not G [the relation 
(138) only defines G in terms of A], and it seems less than plausible to suppose 
that a basic physical coupling constant could be as small as 10 -20 . 

Our point of view is that the emergence of (145) is a clear indication of 
the incompleteness of the theory. Much remains to be discovered, but the 
road towards success probably involves the structure of the particles them¬ 
selves. The relation of modern particle physics to gravitation and cosmology 
is still unexplored territory, but it is in this territory where further insight 
into (145) is likely to be found. Such considerations lie, however, beyond 
the scope of this book. 


10 


SUMMARY 


We have been concerned throughout this book with the thesis that the 
behavior of physical systems is a compound of certain basic laws and of 
an interaction with the universe in the large. The basic laws are more simply 
stated than in the usual field-theory formulation of physics. We have found 
considerable similarity in the responses of the universe to different kinds 
of interaction. In Chapter 4 we formulated the response condition for a 
classical electromagnetic field in terms of a spacetime slab 0 < t < T. 
Writing 


B't < o 4 " B % t > t ( 1 ) 

for the part of the field that could not be attributed to direct-particle 
interactions within the slab, we obtained 

B\ >t (X) = mx\, t - B*(X) adv]o< ( <T (2) 

Explicitly for the righthand side of (2), in (117) of Chapter 4 we had 

[ WUo S < < r = J 2 ^ <W 0)ret, 



(3) 
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[B x {X ) adv ] 0 <t <T ~ 2 2 


(4) 


b 


A m v A i(b) adv , being the fully retarded and the fully advanced potentials 
of particle /?, obtained from the segment of the path of b lying within the 


slab. 


A similar formulation lor mass fields, in (115) of Chapter 6 , gave 


(5) 


Mf y T — M (ret) 


0 < t < T m 


0 < t < T 


In both cases the response from the future is expressed as a difference 
between the retarded and advanced interactions within the slab. 

The quantum form of (2), given in (124) of Chapter 4, was 



where Af b \ A/ (b) , were calculated for paths and conjugate paths, respectively. 
When paths and conjugate paths coincide—as they do in the classical 
case—the form ( 6 ) becomes the same as (2). In (26) of Chapter 8 we had 
a response from the future for the free-particle propagator, 


(7) 


) + (^ret- — ^adv-)]» 



in strikingly similarity to ( 6 ). Since the quantum response ( 6 ) contains 


subtleties not present in the classical response (2), it seems likely that (5) 
requires extension to a more far-reaching form, possibly involving the 
structure of particles. 

It is noteworthy that, although all the results of quantum electrodynamics 
follow from ( 6 ), the appearance of both paths and conjugate paths implies 
that there is no purely amplitude formulation of the theory. Probabilities, 
not amplitudes, are necessarily involved, so that the redundancy present 
in the usual theory does not appear here. 

The influence functionals given in (127) and (129) of Chapter 6 show a 
clear distinction between scattering problems and decay problems. The 
normal procedure in scattering is to add amplitudes for diff erent frequencies 
before taking the square of the modules to give a probability. In decay 
problems, on the other hand, the square of the modulus for each separate 
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frequency is taken first, and the resulting moduli are then added for the 
total probability. This procedure is necessary to give an appropriate time- 
dependence for the decay of a quantum state. It is usually argued that, 
whereas the frequencies for scattering are determined by the 8 + function 
and are therefore systematically phased with respect to each other, the 
frequencies in decay problems arise from randomly phased field oscillators. 
The total amplitude should always be obtained by adding all frequencies, 
but in decay problems cross-products between different frequencies average 
to zero when the square of the modulus is taken. No doubt a consistent 
theory can be formulated in this way, but the issue is treated scantily if 
at all in most quantum texts; so the issue would seem to cause some em¬ 
barrassment. 

The situation is clear-cut in the present theory. The influence functionals 
contain couplings of paths with paths, of conjugate paths with conjugate 
paths, and of paths with conjugate paths. From (127) and (129) of Chapter 
4, it is seen that when a path interacts with a conjugate path it is the same 
frequency that operates on both. There is no case in which a frequency K 
interacting with a path is combined with a different frequency K' acting 
on a conjugate path. This is the decay problem. When paths interact with 
paths, on the other hand, the interaction frequencies have no correlation 
with those that operate in the coupling of conjugate paths with conjugate 
paths. This is the scattering case. 

We began the program of work set out in this book with the initial aim 
of obtaining a quantum generalization of (2). It gradually became clear, 
however, that a commitment to the electrodynamic case must lead to exten¬ 
sions of the action-at-a-distance point of view to all parts of physics. So 
far we have obtained similar results in (5) and (7). But other major issues 
remain, particularly for those parts of physics that involve short-range forces. 
One may wonder how a point of view that has been based on long-range 
forces could come to be applied to parts of physics requiring short-range 
forces. How this may be done has been considered elsewhere (F. Hoyle 
and J. V. Narlikar, II Nuovo Cimento, 7A, 262, 1972). 

In the course of the work, we were led to the view that physical theories 
should be conformally invariant. The development of a conformally invari¬ 
ant theory of gravitation freed us from the need to work always in a con¬ 
formal frame in which particle masses are constants. It is the use of such 
a frame that leads to metric singularities, for example, at the so-called origin 
of the universe or at the implosion of a black hole. With particles acquiring 
their mass by interaction with other particles, we are led to realize that there 
could be surfaces where the particle masses are zero. Whereas the con- 
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formally invariant theory experiences no difficulty with such surfaces, the 
usual theory encounters metric singularities which prevent proper continuity 
across them from being established. The usual theory erects artificial 
boundaries at these surfaces, limiting the universe to only one side of an 
open surface, or to the interior of a closed surface. 

These considerations led us to the view that the universe is probably a 
much larger structure than it is usually supposed to be, a conclusion which 
may well form a base for new cosmological developments. 


APPENDIXES 


To avoid overloading the main text with subsidiary detail, we have relegated 
several investigations to these appendixes. Each appendix is keyed to the 
section to which it is, in effect, a note. However, before we deal with these 
necessary additions, it may be useful to consider the topic of generalized 
functions. In our experience it is fairly common for students to continue 
thinking in terms of conventional analysis while carrying through operations 
with the Dirac delta function. Such operations are then uneasily executed 
—facility depends, of necessity, on suitably broadening the concept of 
“function.” 


APPENDIX 1. GENERALIZED FUNCTIONS 

The Dirac delta function belongs to a class referred to in the literature 
sometimes as generalized functions, sometimes as ideal functions, and 
sometimes as “distributions.” These entities are not functions in the sense 
of elementary mathematics, as will be clear from the following definition 
of the delta function: 



8(x) = 0 for x 0, 


1 . 


( 1 ) 
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Here 5(a) is well-defined for all points on the real axis except a = 0. The 
second part of the definition includes integration through a = 0. For such 
an integration to be applied to an ordinary function, we would need to know 
its value at a = 0. Yet this information is deliberately withheld in (1). In 
going from ordinary to generalized functions, we drop some of the rules 
governing the former while still retaining certain basic operations. Gener¬ 
alized functions can be defined in several apparently different ways, although 
it can be proved that the different approaches are actually equivalent to 
one another. It will be sufficient here to consider two of these approaches, 
which can usefully be illustrated in terms of the delta function. A third, 
more abstract method is given by Courant and Hilbert (1962, II, 774), where 
the equivalence of the three methods is proved. 


1. Consider a finite domain 0 with respect to a real variable a, and let 
the set of test functions <j>(x) be zero outside 0. If all derivatives <f (r r' < r, 
are continuous, but <f (r+1) is not, we say that <p is of class C r . If <f (r) is 
continuous for all r, we say that is of class C 00 . Unless otherwise stated, 
<j> will be taken to be of class C 00 . 

Let J0 be a linear differential operator of order r having coefficients that 
are differentiable at least r times within a closed subdomain 0 of 0. Define 
an adjoint operator £* by 




where if, <f, are test functions at least of class C r chosen to vanish outside 
0. Now let W be any continuous (or piecewise continuous) function in 0. 
We make no statement concerning the differentiability of W in the usual 
analytic sense. Yet we define 



f=£[W] 


to be an ideal function with the 


meaning that 







for any test function <fi(x) at least of class C r chosen to vanish outside 0. 
The righthand side of (4) is well-defined in the usual sense. 

The delta function 5(a) can be defined according to (4) by taking 



d 



W — 1 for x > 0, 


W — 0 for jc < 0, 


(5) 
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with 0 any part of the real axis including x = 0. It is easy to verify that 
for any </> which vanishes on the boundary of 0 and which is differentiable 
at least once. 



= <K0). (6) 

To define we choose <£ to be d 2 /dx 2 , take the same W, and use test 
functions 4> that are differentiable at least twice and that are zero outside 
0. Then we have 


d 2 <b 

j^dx= (7) 

2. Suppose we have a sequence of functions f n (x) continuous in 6. Let 

- 0 , ( 8 ) 

and let the limiting process be uniform with respect to all test functions 
</>(x) with the property that |<J>(;t)|, |<J> (r ) (x)|, r' < r, are bounded by the same 
number M in the closed subdomain 6 of 6. The sequence [f n (x)] is then 
said to be weakly r-convergent in 6. 

We now define a generalized function f(x ), 

f(x) = lim ffx), (9) 




with the meaning that 

f(x) 4>(x) dx = lim f f n (x ) <fi(x) dx. (10) 

n -+co 

The limit on the righthand side of (10) is always well-defined in the usual 
analytic sense in virtue of (8). The righthand side of (9), on the other hand, 
may not exist in the usual sense. 

To illustrate the case of the delta-function, consider 

f n (x) = -y= e~ nV , — N < x < N, (11) 

V 77 

where n is taken large. In this case (8) holds for all test functions <p with 
\4>(x)\ < M throughout 0, i.e., for r = 0, where 6 is any closed interval within 
[ — N,N]. Evidently 
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lim f f n (x) <f>(x) dx = <^>(0) (12) 

n ->oo J& 

provided 6 contains x = 0, from which it follows that 

lim f n (x) = S(x). (13) 

H oo 

Although in the usual analytic sense the functions f n {x) become unbounded 
at x = 0 as n oo, we can define (13) in terms of (10). 

It is to be emphasized that a generalized function does not have meaning 
by itself, but only in relation to 6 , 6 , and to the specification of the test 
functions. For a given sequence [f n {x)\ in order that (8) shall continue to 
hold when 6 is changed, it may be necessary to alter the specification of 
the test functions—extending 0 may require the value of r to be increased, 
for example. Hence the choice of suitable test functions may be related to 
the interval over which a generalized function is defined. 

We illustrate the difference between generalized functions and ordinary 
functions with another example. Let 6 be an interval containing x = 0, and 
let 4>(x) be a test function that can be differentiated twice throughout 6 and 
which is zero outside 0. Relative to such functions and to 0 , we can write 


x5'(x) = — 5(x), (14) 

since both sides of (14) when integrated over 6 against any such <f>(x) give 
— <M0). Suppose, however, that we attempt to divide both sides of (14) by 
x, 



The lefthand side of (15) when integrated over 6 against any c p gives — cp'(O) 
and so is well-defined, but in order that the righthand side be also well- 
defined, it would be necessary for <j>(x)/x to be differentiable throughout 
6 at least once. This need not be the case. For example, take the interval 
[ — 1, 1] for 0 and (1 — x 2 ) n , n > 1, for <|>(v). This satisfies the requirement 
for a test function applicable to (14) or to the lefthand side of (15), but 
not to the righthand side of (15), since <f>(x)/x is not differentiable at jc = 0. 

Generalized functions appear in physics through propagators which satisfy 
wave equations of the second order. In the theory developed in the main 
text, these propagators describe interactions between pairs of particles. 
Characteristically in the direct-particle theory, the propagators are subject 
to integration at each member of a particle pair. From a mathematical point 
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of view, the propagators have no existence in themselves—they exist only 
in respect of test functions and domains of integration determined by the 
particles. Yet in conventional field theory the same generalized functions 
are used in respect of fields which are taken to have an existence independent 
of the particles. From what has just been said, it will be clear that such 
a procedure is mathematically dubious. Evidently, we have a cogent argu¬ 
ment favoring the direct-particle theory against conventional field theory. 
We end this discussion by considering some generalized functions which are 
important in their relation to physical propagators. 

Consider two points x % = 0, x 1 = (.v, t), in Minkowski space, and write 

r 2 = x * x , T 2 = t 2 — r 2 . (16) 

For the generalized function 8(s 2 ), we have 

S(.s 2 ) = 8(t 2 - r 2 ) = 2- l 5 (' - r) + 8(1 + r)] (17) 

2 r 

with respect to test functions of x l having nonsingular first derivatives. The 
well-known Dirac identity 


B x 8(s 2 ) = 4t7<5(a;)S(0 (18) 

can be proved in the following way. First we notice that any spherically 
symmetric solution of U\f = 0 is expressible for r 7 ^ 0 as a linear combination 

— g(t — r) + — h(t -f r\ (19) 

r r 


where g, h, are functions of t — r and t + r, respectively. The righthand 
side of (17) is of this form, so that Q r 8(s 2 ) = 0 except possibly at the point 
r = 0, / = 0. To prove that 8(s 2 ) is a generalized function equal to the 
righthand side of (18), with respect to the test functions described above 
we have to integrate over a closed domain in spacetime that includes x' = 0 . 
Taking the righthand side of (17) to represent S(s 2 ) and considering, first, 

the term 


8(/ - r) 
2 r 




and letting the domain be a 4 -cylinder whose cross section in space is the 
sphere r < R and whose generators are between — 7'<t<7’, T ~> R, we 

get 
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f UJd'x 



l 



R 


0 


d l 

a t 


Amr' 1 dr 


t - T 



-T 


d l 

dr 


4ttR 2 dt 


r-R 



R X' 


8\ T - r) 


o 


r T r X’ 

477r 2 dr + | 

2 r J-t 


8\t - R) 8(t 

-1- 


R) 


2 R 


2 R 


4ttR 2 dt 


2 77 R 8(T - R) + 2ttR 8(T - R) + 2tt = 2tt. 


( 21 ) 


In the second line here we have made use of the fact that f = 0 at t — —T. 
Hence there is no surface integral at t — — T. A similar calculation for the 
second term on the righthand side of (17) also gives 277. Hence, 



the same as the integral of the righthand side of (18), which is therefore 
established. 

By using (18), one can derive the Fourier representation of S(s 2 ) very 
quickly. Write 


d(s 2 ) - j\(k i )exp(-ik- (23) 
where k * x = x l . Since 

8(x)8(t)= I exp( — ik • x) > ( 24 > 

it is easy to see that 

X(^)=-t4. (25) 

k" 

[To prove that the righthand side of (24) is the same generalized function 
as the lefthand side, with respect to a spacetime domain including x l — 0 
and to suitable test functions, integrate each component of the vector k { 
over the interval [ — K,K]. By taking an increasing sequence of values of 
AT, one obtains a sequence of functions which tends in the limit K —>- oo 
to the lefthand side of (24). This is easily seen by using the second of the 
two ways discussed above for defining generalized functions.] 

Care is needed when (25) is substituted in (23) to take the principal part 














APPENDIX 1. GENERALIZED FUNCTIONS 


215 


of the resulting integral. 


5 (5-)= - J 4^ exp(-,*•*) (26) 

The need to take the principal part arises at the poles of 1/A; 2 . Writing 
k l = ( k , co), we have poles in a complex co-plane at 

^ = ±\k\. (27) 

If instead of taking the principal part at each pole we were to take the residue 
at co = +|Ar| only, we would get 8 + (s 2 ). This is often stated in the form 



r 477 

J k 2 + ie 


exp( — ik • x) 


d 4 k 

(2tt) 4 ' 



Other solutions of U x f = 4tt 8(x) 8(t), e.g., 8_(s 2 \ 8(t - r)/r, 8(t + r)/r, can 

be obtained by choosing suitable contours in the co-plane. These are shown 
in Figure A 1.1. 


co-plane 



Contour 

Solution 


Co 

8(S 2 ) 


C + 

S + (S 2 ) 


c_ 

8JS 2 ) 


c 

'“ret 

Full retarded 

solution 

c 

^ adv 

Full retarded 

solution 


FIGURE A1.1 

Solutions of / = 47 t<5(x) 8(f). 
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APPENDIX 2 (a note to §4.2) 


In the nonrelativistic theory the propagator K( 2; 1) is given by the path 
integral, 

for t 9 > t v whereas for t 2 < t 1 we have K(2; 1) = 0. Given the wave function 
ip(x v t x ) on t = t v 

\p(x 2 , t 2 ) = f K(2; l )ip(x 1 , i t ) d 3 x v t 2 > t 1 . (30) 




The connection with the main text is contained in the question: what 
differential equation does \p(x 2 , t 2 ) satisfy? We answer this question with the 
help of a specific example. 

Consider a particle of mass m moving in one direction, the ^-direction, 
say. Let V(x, t) be the potential of the particle in some external field. Then 
for a given path f 21 we have 



r 1 2 

ri 0 1 

SIAJ = 

1 

—mx 2 — V(x, t) 

_ m 




Writing t 2 = t 1 + e, we can express (30) in the form 

oo 

Ip(x, t 1 + e) = f K(x, t l + e; x — 17 , t^xp^x - ?], r x ) dx\. (32) 

— 00 


Taking e small, all nonrelativistic velocities x for the particle are contained 
in a range of rj of the order e. Hence to the first order in small quantities, 
the contribution of the potential term in (31) is — e V(x, C)- To this order 
(29) gives 



The path integral in (33) is just the free-particle propagator, the evaluation 
of which forms the basis of calculations using the path-integral method 
(Feynman and Hibbs, 1965, p. 42). In the present case this evaluation gives 




K(x, t x + e; x - 77 , t x ) 
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Expanding the wave functions in (32), 

dip 1 d 2 \p 

i(x - 1 ), t x ) = \P(x, t x ) - 11— + —T ] 2 -— + ..., (35) 

d\p 

1 1 + c) = *p(x, / x ) + e — —(36) 

ot 



where xp and its derivatives and V are taken at x, t v For convenience in 
evaluating the righthand side of (37), the range of 17 is taken from —00 
to 00 . This is permitted because of the convergence produced by the rapidly 
oscillating exponential. In lowest order (37) is satisfied identically. The term 
of first order in 77 evidently vanishes, and integration of the 17 2 term yields 
a quantity of order e, which, on being equated to the e term on the lefthand 
side, gives 



z 

ti 


V\p + 


ifi d 2 \p 
2m dx 2 



This is the one-dimensional Schrodinger equation. The same argument can 
readily be extended to obtain the three-dimensional Schrodinger equation. 


APPENDIX 3 (a note to §4.7) 1 

§A3.1. A Path-Integral Method Leading 

to the Perturbation Expansion in 
Terms of the K + Propagator 

We begin with the free-particle case. Define two propagators by 

/f 0 + (2; 1) = 0(t 2 - /,) 2 u n { 2)« n (l), 


1 Adapted from Hoyle and Narlikar, 1971. Equation (39) here was (30) there. 
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K 0 -( 2; 1) = -0(t, - t 2 ) ^ u n (2)u n (l). (39) 

n 


Both satisfy the inhomogeneous Dirac equation 

(t 2 + im)K 0 H 2; 1) = 5 4 (2, 1), (40) 

and both are confined within the light cone at point 1, K 0 + ( 2; 1) being 
nonzero only for points 2 inside or on the forward half of this cone, and 
K 0 ~( 2; 1) being similarly confined to the backward half of the cone. 

Corresponding to these propagators we distinguish two types of path, C + 21 
going forward in time from point 1 to 2 ( t 2 > r r ) and a path C - 21 going 
backward from 1 to 2 ( t 2 < t x ). Both are monotonic with respect to time. 

To define amplitude along C + 21 divide t 2 — t A into a large number of 
small subintervals at intermediate points X t , i — 0, 1,. .., N. Let i — 0 
correspond to point 1 and i = N io point 2. Define P(T + 2 i) as 

N 

II Wft i - 0, (41) 

i = 1 

for N large. We shall discuss whether or not N should tend to infinity later. 
The amplitude P(r~ 21 ) for af' path is similarly defined with a chain of 

K 0 ~(i ; / — 1) propagators. Clearly the paths C + 21 and C~ 21 must be timelike 
throughout. 

The measure for the sum over paths is so defined that 


V(2; 1) = 2 p ( f± n) 






Suppose we study a free particle in the four-dimensional slab t 1 <t< to. 
At t = t x paths coming from the past represent an amplitude from some 
previous history of the particle. We denote this amplitude by \f/ + . Similarly 
at t — t 2 paths coming from the future give an amplitude which we denote 
by \p_. Thus the amplitude at a point 3 within the slab is given by paths 
r + 31 starting at points 1 on t = t x and weighted by ^ + (1), together with 
paths r ~ 32 starting at points 2 on t — t 2 and weighted by \p_(2). That is 
to say, the amplitude \p( 3) is given by 

<H3) = J K n + (3; 1)y 4 ^ + (1) d 3 x x - f K 0 ~( 3; 2)y 4 ^_(2) d 3 x 2 . (43) 
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If * p + is made up of positive-energy solutions u n at t = t v then \p + for 
t > t 1 is always made up of the same positive-energy solutions. Similarly 
if \p_ is made up of negative-energy solutions at t — t 2 , then \p_ for t < t 2 
is always made up of negative-energy solutions. We impose this property 
as a boundary condition. Then (43) is equivalent to 

*(3) = f K + (3 ; l)y 4 ^ + (l) d 3 x t - f K + (3; 2)y 4 ^_(2) d 3 x 2 . (44) 


Suppose now that we are given an external potential B t nonzero in 
t x < / < t 2 but zero outside this range. This does not interfere with the 
boundary condition just discussed. We now admit that within the slab 
t x < t < t 2 interaction with B i can turn the time-direction of a path. Thus 
paths within the slab can be zigzag with as many reversals as we like, whereas 
paths outside the slab are monotonic with respect to t. 

The amplitude for a f + 21 path is defined by 


P B (r + 21 ) = 


P(f + 2l) eX P 




and that for a f V2 path (point 1 on t ■= t v point 2 on t — t 2 ) by 


P B (r~ 12 ) = 





Paths within the slab need not be monotonic with respect to however. 
Suppose we have a path from 1 to 2 with In reversals. Denoting intermediate 
points by /, sections between successive reversals are monotonic, and the 
amplitude is given by 


pB ( r 21) = n < 4r > 

1 

where the + sign holds for forward-going sections and the - sign for the 
backward-going sections. 

Clearly it is also possible to have paths starting at t 1 and ending at t v 
In this case we get a at t = t x even though there may be no \p_ at t = t 2 . 
This does not happen when B i = 0. Similarly we could have a ^ 0 at 
tn originating solely from \p_ at t 2 , because ol reversals by B { . 
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We now wish to calculate the following problem. Given \p + on t = t 1 and 
= 0 on t = / 2 , what are on t = g and \p_ on / = t^l 
Consider the evaluation of ^ + (/ 2 )- Formally we have 

4 + B (2) = if P B (J' 21 )y 4 4'+0)® 3r 12 d 3 x v (48) 

which includes paths with reversals. To evaluate (48) consider the paths 
according to the number of reversals, beginning with paths C + 21 without 

reversals. 

Divide the time-range t 1 < t < t 2 into a large number of small intervals. 
Consider one such interval ( t , t + c). Let the path C + 21 intersect the time- 
sections at t and / -h € at spatial points x, y\ respectively. Since C + 21 is 
timelike, \y — x\ < c. Expand the exponential factor in (45) in a power 
series. The unity term gives the usual free-particle contribution. The linear 
term gives 


le 


^ if f [ B 4 (X , t) -(y - X)- B(x , t)/e]P(r + 2 i)y i 4'+( 1 ) °P 3 r + 2l d‘‘x v 


all 
intervals 


(49) 


Now P(r + 21 ) can be written as a product of three factors 


p(r + 21 ) = P(r + 2y )P(r + vx )P{r + xl ). ( 50 ) 

The sum over T + 21 is equivalent to sums over r + 2y , r + yx , r + xV together 
with integrations over x, y. The sum over r + xl propagates the positive- 
energy part of the wave function at 1 to x. This gives ^ + (x, /). The sum 
over P(r + yx ) gives y, t + €; x, t). The sum over P(r + 2y ) similarly gives 
K 0 + ( 2; y, t + e). The latter function propagates only the forward-going paths 
at t + e, and the weight of these paths is given by the positive-energy part 
of the wave function at (y, t + e). In general, because of B i in (t, t + c ), 
there will be a negative-energy part in the wave function at t + €. This 
negative-energy part gives the weight of backward-going paths "reflected" 
by the step (/, t -f e). The negative-energy part does not affect \p + (2). We 
ensure that only the positive-energy part of the wave function contributes 
by using K + (2; v, t -h €) instead of K 0 + (2;y, t -h c). So we get for (49) 

-ie X 1 f f K +( 2 ’)’’ 1 + Oy 4 [£ 4 (*, 0 -(>'-•«)• B(x, t)/i] 

• K 0 + (y, t + «; .v, t)y 4 xp.(x, t) d x d 3 y. (51) 


Because |y — x 


< c, we can replace K+(2,y, t + e) by K + ( 2; x, t) to within 
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a term that tends to zero with €. Using (39) we have 


K 0 + (y, t + t; t) = 2 U n (y)U n (x)e- i ^ 


n 


= 5 3 (>’» *>Y 4 - k X H (y) U n(y) U n( x ) + °< e ~)’ (52) 

n 

where H is the Hamiltonian and 


u n (x, t ) = U n (x)e- iE nK 

Thus (51) becomes 

-*2‘/ K+(2; x, t)y 4 B 4 (x, l)tp + (x, t) d ’x 
-ie 2 /£ f f A '+<2; *, 0 y 4 ( >’ - x) • B(x, t) 



2 H(y)U n (y)U H (x) 


yyl'+ix, t) d 3 x d 3 y. 



Now for a free particle 


H = — /a • V + my 4 

and the my 4 term makes no contribution to the second part of (54). The 
only portion of this second part that survives as e —0 is that which arises 
on differentiating y — x, using integration by parts with respect to Fy from 
H(y). Since 2 n U n (y)U n (x) — 8 3 (y,x)y 4 it is easy to see that (54) is just 

K + ( 2; x , t)$(x, t)$+(x 9 1) d 3 x. (55) 

For small € the summation can be replaced by integration with respect to 
/. Denoting (x, t) as point 3 we therefore obtain 

-ie f K + ( 2; 3)$(3W + (3) dr 3 , (56) 

the integration with respect to dr 3 being over the four-dimensional slab 
t 4 < t < t 2 . 

The second-order term in the expansion of the exponential in (45) can 
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be worked out in the same way and gives 



K + (2- 4)#(4)K + (4; 3)4(3)* + (3) dr 3 dr 4 . 



and similarly for higher-order terms. The appearance of K + propagators 
after each $ can be understood in the following terms. After each $ scatter¬ 
ing the wave function appears in a general \p + + \p_ form. Since we are 
propagating along af + path we must use only the \p + part. That is to say, 
we need to act on But this is the same as K + acting on -f \p_. 

Consider now paths with reversals. Evidently to arrive at / = t 2 from t = t 1 
we must have an even number of reversals. It will serve to bring out the 
main features if we treat the simplest case of two reversals. Consider a path 
that goes from t 4 to t 3 > t v then reverses to t 4 < t 3 , and reverses again to 
t 2 > t 4 . In order for paths of this kind to contribute to the amplitude the 
potential must be nonzero at t 3 and t 4 . Again dividing t 1 < t < t 2 into small 
steps, the integral f B t da 1 is replaced by a sum over steps. This integral 
appears squared in the second order of the perturbation expansion, so that 
in this order we have a double sum over steps. The double sum contains 
a member that refers to the steps (t 3 — c, t) and (7 4 , t 4 + c). This member 
gives the lowest-order contribution that the path in question makes to the 
total amplitude. 

We have to sum over all paths from t x to t 2 with reversals at t 3 and t 4 . 
Since for such paths the particle is free (in the second order of the pertur¬ 
bation expansion) between t 1 and t 3 — c, it follows that \p + at t 1 is carried 
into a i p + wave function at t 3 — e. Within the thin slab (t 3 — e, t 3 ) scattering 
by B t changes the wave function into a xp + + \p_ combination at t 3 . Since 
the paths are now backward and the particle is again free (second order 
of the perturbation expansion) between t 3 and t 4 + €, only the xp_ part of 
the wave function at t 3 is carried to t 4 + c. Between t 4 + e and t 4 there is 
a further scattering by B { , and this again sets up a xp + + \p_ combination 
at t 4 . The paths now go forward to / L >, and since the particle is once again 
free only the \p + part of the wave function at t 4 is carried to This “ac¬ 
counting” procedure is rather complicated when the K {) ± propagators are 
used, but the K + propagator automatically keeps a correct record at each 
reversal. It is therefore preferable to use K + for the sections in which the 
particle is free. 

Summing over all paths that reverse at times t 3 , t 4 (t 3 > t 4 ) in [t v t 9 \ we 
obtain 
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< 


ie)2 If 


K + ( 2; 4)$(4)K + {4\ 3)$(3^A3) dr 3 dr 4> 


(58) 


h>U 


and (57) and (58) together give 


( 


ie) 2 f f K 4 (2\ 4)$(4)K + (4; 3)ft(3))p + (3) dr 3 dr. 


(59) 


as the second-order term of the perturbation expansion. 

In this last step we are assuming that the case t 3 = t 4 gives no contribution 
to the amplitude. This case can be dealt with in the following way. When 
the path first reverses at t 3 , the xp_ part of the wave function is of order 
€, being proportional to the thickness of the slab (t 3 — c, t 3 ). When this \f/_ 
part is scattered again by the same slab, the resulting \p + part of the wave 
function is of order e 2 . This \p + part then goes to t 2 . We have a similar 
contribution for each thin slab, but since there are only of the order of € _1 
such slabs, the total "double scattering" is of order e and tends to zero 
with €. 

to give 

*+(2) = I K + B ( 2 ; Oy^+O) d 3 x v (60) 


The above discussion is easily generalized 


where 

K + B (2; 1) = K + (2\ 1) - ieJ'K + ( 2; 3)$(3)K + (3; l)dr 3 

-f (— ie) 2 if K + (2; 4)$(4)K + (4\ 3)$(3)K + (3; 1) dr 3 dr 4 + .... (61) 

A similar analysis leads to 

f_(l') = f K + B (V; 1 )y 4 ^ + ( 1) d 3 x v (62) 

where K + B ( 1'; 1) is given by replacing point 2 in (61) by point T, the latter 
point being on f = t v 

The same procedure for (61) as that used for (16) leads easily to 

[f 2 + ie$ (2) + im]K + B ( 2; 1) = 5 4 ( 2, 1). (63) 

Equations (60) and (62) give the solution to the problem we set out to solve, 
and (61) is the required perturbation expansion. 
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§A3.2. Many-Particle Interactions and the 

Generalized Response of the Universe 

We begin by restating the conclusions of the preceding section in a slightly 
different notation and in a somewhat more general form. 

We now describe the slab under investigation by 0 < t < T instead of 
by t x < t < t 2 . The planes t =■ 0, t = T will be referred to as the surface 
S and points 1, 2,... , T, 2',... may be anywhere on S. The work of the 
previous section is described by 

= fK + B (V-, l)Mn(l )dS v (64) 


Here we introduce the concept of ingoing and outgoing parts of the wave 
function at the surface S. In the notation of the previous section xp in is a 
wave function on t = 0 and is a \p_ wave function on t = T, while \p ont 
is a ip + wave function on t — T and a \p_ wave function on t = 0; n { is the 
unit inward normal to S , and dS 1 is the surface element at point 1. 

Most one-electron problems in quantum electrodynamics are concerned 
with calculating \p out when xp in and the external field B { are given. 

Consider the following generalization of (64) to the many-particle problem 


*out< 1 A--.) 






.. ^ (1, 2,...) dS l dS. 2 



The wave function T in now has four components for each particle and has 
a point 1,2,... for each particle. Similarly T out has four components and 
a member of T, 2',... for each particle. The matrix K + B ( 1'; 1 )yl l acts on 
the components of T in for the particle at point 1; K + B (2'; 2)fl 2 acts on the 
components for the particle at point 2; and so on. The order in which these 
matrices are written is irrelevant. 

This generalization is valid for particles acted on by an external field, 

but is invalid when there are mutual interactions between the particles. Then 
we write 


2 ',...) 


out 


If f K{\\ 2',... ; 1,2,.. .)^i^2 • • • 'J'inO* 2,. ..) dS x dS 2 ... 


( 66 ) 


in which the many-particle propagator K( T, 2',... ; 1, 2,...) is determined 
in the following way. 
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We recall that 



The multiple-path integral for a many-particle system in an external field 
B, is 

K(V, 2 ',...; 1 , 2 ,...) 



/ f P(r vl )P(r 2 , 2 ) 







and this separates into the product K + B ( T; 1) K + B (2 ; 2)... as stated above. 
But when the particles are mutually interacting (68) is changed to 


K( T,2',...; 1,2,...) = //-/ p{r vl )P(r 2 , 2 )... exp(/7?) 




da 1 + 




in which the factor exp(/7?) represents interparticle action. In general exp(iR) 
takes such a form as to prevent (69) from being separable. The particles 
are here denoted by a, /?,..., and da t , db i9 ... are coordinate displacements 
along the paths r rv r 2 , 2 ,..., respectively. 

We now aim to prove that all the results of the usual theory of quantum 
electrodynamics follow from choosing the classical time-symmetric action 
for R , 


ab) da i dbi ■ ■ ( 7 °) 

a b ^ n * 2'2 

Before proceeding, however, we pause to note that if T in (1,2,...) is 
antisymmetric with respect to every pair of particles, then so is T out 
(T, 2',...). This can be seen from (66), (69), and (70). The exclusion principle 
has a rather mysterious role in the usual formulation of quantum electro¬ 
dynamics. The present path-integral approach throws some light on its 
meaning. It prevents the paths of different particles from sharing common 
points. The paths r rv /V 2 , cannot cross, for example—more accurately, 
paths with common points make zero total contribution to the amplitude. 
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Paths for the same particle, on the other hand, can have common points 
or common segments, because paths for the same particle are alterna¬ 
tives—the particle follows one of its possible paths. It is indeed this property 
that permits us to say whether two paths belong to the same particle or 
to different particles. 

Returning to the multiple-path integral (69), the electromagnetic factor 
is the exponential of i times 

J B i dai ~ _ 

a a b 

the line integrals being over the paths r ri , r 2 , 2 , .. ., for the particles a , 
6,..., respectively. The paths are segments lying within the slab 0 < t < T. 
There are interactions with paths outside the slab, and it is these interactions 
that give B t . There are contributions to B t from both t < 0 and t > T. The 
interaction with the future arises because paths interact through the time- 
symmetric function 8(q 2 ). The external field from t < 0 has already been 
discussed in detail in §4.3, so we shall not repeat the discussion here. That 
is to say 





t < o 


+ B 


t > T-> 


and we take 



t < o 





The first term of (71) is then the response of the universe. 

Apart from containing terms with a = b, which we shall discuss at the 
end of the present section, (71) is the same as the classical action in the 
direct-particle theory. It is interesting to recall the classical treatment of the 
action, since the classical treatment and the quantum treatment will turn 
out to have similar features. First we write 




2 e af A \a) da i< 


where 


A\ a) {X) = V e, J S(q 2 XB ) db\ 

b a 


( 74 ) 
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Separating 8(q 2 AB ) into advanced and retarded parts. 


^ W 2 ab ) 


1 


^ I X A X B 


f B 


X A ~ X b \) 


+ 8(t 


A l B 


t Ft 4 " 


- *«l)], (75) 


and writing 


A \a)(X) =jlAaP Oret + AaAUl 


(76) 


Aa)(X\et = S Am (X) 

b a 


ret 


s 

b a 



X 


X 



X — X 

: B 



db\ (77) 


Ai axi dv = 2 = y 


b a 


adv X , '"b 
b^L a 



S(t 


X B 


f Tt + 


* B I) 


X — X 


B 


db\ (78) 


enables the classical action to be written in the form 


S e a f B\>rd a i 2 2 e a J l^\a)re t + ^(a)adv]^i* 0$) 

a a 

The classical response condition is 

Bi(X \> T = y 2 Mi <a> (^)ret - ^Wadvl- (80) 

a 

Eliminating B\ >T between (79) and (80), we get 

~Z, e af { Aa, ret + J M i<a, ret ~ ^“'advl) A. (81) 

The first term in the curly brackets is the usual retarded classical field of 
all particles other than a , and the second term is the usual radiation reaction 
on a. 

Our aim is to obtain a generalized response condition for the quantum 
case, analogous to (80), that together with (71) leads to an electromagnetic 
phase factor in (69) giving all the results of the usual theory of quantum 
electrodynamics. 

The quantum case differs from classical theory in that conjugate paths. 
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a'(t) for particle a , b'(t) for particle b ,..., must be introduced. These con¬ 
jugate paths go from points 1*, 2*,..., on S to the same points T, 2 ,..., as 
do the paths a(t), b(t) .... Instead of (71) we now have (71) minus the 
contribution of the conjugate paths. It is easy to see that this combination 
of paths and conjugate paths gives 



in which 


^Wret = 2 

b 

^Wadv = 2 e b 

b 

and where the primed fields refer to the conjugate paths and are given by 
expressions similar to (83), (84), and B'\ > T is the response to the conjugate 
paths. Multiplying (82) by i and taking the exponential gives the electro¬ 
magnetic phase factor. 

The first integral in (82) cannot be expressed in a form suited to practical 
calculation except in association with the second integral. That is to say, 
we do not know the response of the universe except in a probability calcula¬ 
tion. The usual theory of quantum electrodynamics, on the other hand, 
allows a complete calculation of the amplitude alone, irrespective of whether 
a probability is taken. Since amplitudes do not appear in physical statements, 
the usual theory seems unsatisfactory in this respect. It is true that physical 
statements can be made by discarding the phase of the amplitude, but then 
why go to the trouble to calculate the phase if it is not required? 

The slab 0 < t < T now has an explicit meaning. It is the slab in which 
an experiment is performed, i.e., for which we require a physical statement. 

Paths and conjugate paths together permit a separation of positive and 
negative frequencies. In classical theory there is one path for each particle, 
and this one path contributes both positive and negative frequencies. In 
quantum electrodynamics paths give positive frequencies and conjugate 
paths give negative frequencies. Negative and positive frequencies are 



8(t 


x 


*B + 




B 


db\ 


(84) 
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introduced in the following way. Define 




ty t r> A -V o I ) 



(85) 


b 


^Wadv± =Z e »f 


b 



( 86 ) 


where 



(87) 


It may be noted that whereas the sum of A \ et+ , y4* ret _, is a vector, namely, 
the classical retarded field 


( 88 ) 


— At _|_ Jt 
ret ^ ret+ ' ^ ret—’ 


ret+ 


A\ ei+ , A\ et _ are not vectors by themselves. Similarly 


(89) 



is the classical advanced field, but A\ dv+ , A i adv _, are not vectors by them¬ 
selves. 

This separation into positive and negative frequencies permits two new 
vectors to be defined, A i ret+ — A\ dv+ and A\ et _ — A\ dv _. These vectors 
appear in quantum electrodynamics but not in classical theory. It is 
straightforward to show that 


A\X) 


ret+ 


^(^Q a dv+ 


X e b f s +( c / 2 xb) dt>i - f S_(c/ 2 BX )db l , (90) 




X e b f 8_(q 2 XB ) db l — j' 8 + (q 2 BX )db l . (91) 



Although 8 ± {q 2 ) are invariant functions, it is not immediately clear that (90) 
and (91) are vectors, because of the time restrictions. However, the contri¬ 
butions of the principal parts in 8 ± (q 2 ) have opposite signs. It is easy to 
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see therefore that the time restrictions do not apply to the principal parts, 


which accordingly give vectors. The time restrictions continue to apply to 
the 8(q 2 ) part of 8 ± (q 2 ), but since 8(q 2 ) is confined to the light cone, the 
restrictions are invariant, and the 8(q 2 ) part also gives vectors. 


The quantum response corresponding to (80) is 


Bi(X) t>T 



As stated above, the paths carry the positive frequencies and the conjugate 
paths carry the negative frequencies. If we imagine a coalescence of paths 
and conjugate paths (92) becomes the same as (80). To obtain an equivalent 
condition for B i '(X) t>T we interchange paths with conjugate paths and 
positive frequencies with negative frequencies. This double interchange 
leaves (92) unaltered, so that 


B i '(X) t>T = B i (X) t>T . 


(93) 


To complete the work of the present section we calculate the contribution 
to (82) from particles a, b. For a ^ b we have four paths a, b , a \ b\ which 
combine in the pairs (a, b ), (a', b '), (a\ b), (a, b '). The contribution of (a, b) 
has two terms, 



(94) 


from the time symmetric interparticle action and 

IWab) - W AB )} da i db l 




(95) 


from B x . Since 


2 S (Tab) = 8 +(q 2 AB ) + 8 ± (q 2 AB ) = S ± (q 2 BA ), (96) 


the sum of (94) and (95) is 



( 97 ) 
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The pair (a\ b') similarly give 


+ e a e b f f 8 -{q 2 A’i?) da i db'K 


(98) 


Only the B t , B t ' terms in (82) contribute to (a’,b). It is not hard to show 
that these terms combine to give 


e a e b 


f f W A , B ) da,'db* -ff 8_(q 2 BA ’) da/ db* 

t A' > ts t B > t A ' 


(99) 


The pair ( a , b r ) gives 


a e b [/ f 8 M 2 Ba) da ' db i - ff 

^ B' ^ t A tA ^ ^B' 


8-(q 2 AB') 1 dbi 


. ( 100 ) 


The total contribution of the particles a , b, to the electromagnetic phase 
factor is given by summing (97)-(100), multiplying by /, and taking the 
exponential. 


exp \ le 


l A' ^ l B 


8 M 2 A’b) da i db 


2 I I £ {q 2 

^B > tA' 



S-(r B A') da i db 


+ 


f f b +(q 2 B' A ) dc,i db i - ff 8 M 2 a&) da ' db i 

t A > t B ’ 


^B’ > ^A 


f f 8+(q 2 AB) dc>i db i + f f 8 M 2 A'b) da i db ' 1 


( 101 ) 


This is the influence functional F[a, b\a\b'] contributed by the particles 
a , b. It can be expressed in terms of three-dimensional Fourier integrals 


using 


<5 ± (<7 2 2i) = ±4ni f -f- exp[ 


‘K\t 2 - 


ik-{x 2 - 102) 


where K = |A| and q 2 21 is the invariant distance between points 1 and 2. 


We get 


F[a,b;a\b') 


exp 


(fafb r 

\ 477 2 J 



di2 KdK 


(// 


txv[-iK\t A - t B 


■P /A 


B 


x ,)] da, db 1 
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+ f f exp[/A'|/ 4 , 

- f f exp [iK(t A 
-If exp [iK(t B 



"f" ik * 


5') “1“ lk # 

- (O + • (*!' 


— *£,)] <7#/ db' 1 

— x A )] <7^ db' 1 

— x B )] db 1 



(103) 


The different terms in (103) can be interpreted as follows. For t A > t B 
the positive terms in the curly brackets contribute to a downward transition 
of particle b and an upward transition of particle a , and vice versa for t B > t A , 
whereas the negative terms contribute to downward transitions of both a 
and b. Thus (103) refers to absorption and stimulated emission. Since the 
coefficient in front of all terms in the exponential is the same, the proba¬ 
bilities for these two processes are the same. The influence functional (103) 
leads to all the usual results for the interaction of two charged particles. 
There is a similar factor in the multiple-path integral (69) for every pair 
of particles. 

Consider next the case a — b. There are contributions from three path 
combinations (a, a'), (a, a ), and (a \ a'). These may be obtained in the same 
way as in the above discussion, or by identifying a and b , a' and b\ in the 
above formulas. However, if we follow the latter course a factor \ must be 
introduced, because of the \ in (70) and because a and a' interact only once. 
The result is 


F[a , a'] — exp 


{ ie a 2 // &+(q 2 A'A) da i da 



8_(q 2 aa ') da^ da 


t A > t A . 


\ 

2 


J f S +<3 


2 


1 


AA 


) da 1 da f + j f f s -(q 2 A 'A') da ' 1 da i 


In terms of Fourier integrals the corresponding result is 


F[a> a'] = exp 




00 


kdK 


ff exp [iK(t 


* A’) 


+ ik • (x A , — x ,)] da 1 da:’ 



+ I I exp[/A(/;f 

‘a >‘7 


1.) + ik ■ (x 


x ,)] da 1 da. 



+ I I exp [iK(t A , 

‘a- > tX’ 


t A .) + ik • (xr. 


x^,)] da ' ’ da A 


• (104) 


. (105) 
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The first term in the curly bracket gives spontaneous transitions, whereas 
the second and third terms contribute “radiative correction effects.” There 
is a factor of the form (105) in the multiple-path integral (69) for every 
particle. 

For many purposes it is preferable to express the 8 ± (q 2 ) functions in (101) 
and (104) as four-dimensional Fourier integrals, using 


W) 


4tt 



1 


o+ 


k 


p—ik'{x 2 —x t ) 


d 4 k 


it 


(2 77 ) 


4 * 


(106) 


We come now to the problem of the self-action contributions to (71). The 
contribution of particle a is 



(107) 


where A, A , both lie on the path a(t). In classical theory (107) is omitted 
entirely; particles do not act on themselves. It is actually sufficient in classical 
theory to make a weaker hypothesis—that sufficiently small steps of a path 
do not act on themselves—for classical paths are wholly timelike, and 

q 2 ^ > giving 5(< q 2 A ^) = 0 when A and A are separated. 

In quantum theory, on the other hand, there is a difference between 
omitting (107) and making the hypothesis that individual path steps do not 
act on themselves, since the latter hypothesis still permits different r + and 
r~ sections of a path to interact, whereas a complete omission of (107) would 
remove all self-interaction. In the above work we include (107) in order to 
obtain the usual electron-positron interaction. If there were no self-action, 
the response B i t>T would contribute the exponential of 

- Y ie a f M <a, iret + “ ^iadv+1 da * 


j(a) sjni _|_ ip 

™ iret+ aa ' 2 “ 

to the (a, a) term in (105). The second term in (108) is minus the self-action 
contribution (107). It is canceled therefore when (107) is included, so that 
the first term of (108) yields the (a, a) part of (105). If (107) is taken to be 
zero for timelike paths, the second term of (108) is also zero, and we still 
obtain (105), even though self-action is absent. 

It seems possible that in quantum theory (107) may vanish for r + or 
r~ paths individually, even without requiring the hypothesis that individual 




iret+ 


+ ^ < “ > iadv+]^ i ( ,08 ) 
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path steps do not act on themselves. Suppose A and A are on the same step, 
da 1 = da \ Since quantum paths are made up of null segments, q 2 a j = 0, 
but this does not cause difficulty so long as A and A are distinct, since 
da i da 1 is proportional to q'XA an d q 2 aa aa) — 0- In general for an individ¬ 
ual r + or r~ path, either we have q 2 a j > 0, Siq 2 ^) = 0, or the path be¬ 
tween A and A is a straight null line. So long as A and A are taken to be in¬ 
terior prints within da 1 , dd\ we have da t da 1 proportional to q 2 ^x anc ^ 
S(q 2 A2) da { dd l is again zero. 

The self-action integral becomes ambiguous, however, when A and A are 
permitted to coincide. This ambiguity is removed if we omit self-action on 
a sufficiently small scale. We shall find in §A3.4 that this question is inti¬ 
mately connected with divergences in radiation correction processes. 


§A3.3. Vacuum Loops 

The amplitude for any process in quantum electrodynamics can be cal¬ 
culated from a perturbation expansion, provided a suitable factor, say, C v , 
is included to represent vacuum loops. In §A3.1 we obtained the perturba¬ 
tion expansion for a single particle in an external field by the path-integral 
method. In §A3.2 this method was extended to a many-particle system, and 
the mutual interactions of the particles were correctly represented by com¬ 
bining the classical electromagnetic action with the response of the universe. 
This is the position we have reached. To complete our program it remains 
to return to the problem of vacuum loops and to work out the factor C v . 

We show first that C v can be obtained by applying an antisymmetrization 
process to the perturbation expansion (61) for a particle in an external field. 
Since (61) was obtained from a path integral, it is necessary to show next 
how the extra terms required by antisymmetrization can also be represented 
by a path integral. This one-particle treatment must then be generalized 
to many particles and must include interparticle action. We proceed in these 
three stages. 

The idea underlying the antisymmetrization of (61) is illustrated in general 
terms in Figure 4.5, p. 76. In (a) we have two particles going forward 

in time, one particle going from 1 to 3 by a r + path, the other from 4 to 
2 also by a f + path. The amplitude for (a) does not represent the total 
amplitude for there to be a particle at 2 and a particle at 3. The exclusion 
principle requires subtraction of (b), with 1 going to 2 and 4 to 3. 

In (c) we have a single-particle path with two time reversals, and with 
an external field B i acting at the points 3 and 4 of reversal. The configuration 
of points is the same in (c) as in (a), and the electron paths from 1 to 3, 
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4 to 2 are the same. If now we follow the suggestion of the preceding 
section—that time reversals generate “new” particles—we require the r + 
sections from 1 to 3, 4 to 2 in (c) to be on the same footing as the same 
two paths in (a). We should then subtract the amplitude for the possibility 
that the r + paths starting from 1 and 4 go to 2 and 3, as they do in (b), 
instead of to 3 and 2. The effect of this redirection of the paths is shown 
in (d). We now have a single T + path 1 to 2 and a disconnected closed 
loop with a T + path from 4 to 3 and a r~ path from 3 to 4. The external 
field acts at the two points of reversal of the loop, not on the r + path from 
1 to 2. We shall next give quantitative expression to this idea. 

The amplitude for (c) is given by restricting the second-order term in the 
perturbation expansion to t 3 > t 4 . 



Since 1 to 3 is a r + path, (109) operates on a \p + wave function at point 
1. Writing the wave function as l> + ( 1) and integrating spatially with respect 
to 1, the amplitude for the double scattering of (c) is 



The spatial integration with respect to 1 is over the 7 = 0 face of the slab 
0 = t x < t < t 2 = T, the integrations with respect to dr 3 , dr 4 , are taken 
through the slab, and point 2 is on t — T. Since t 2 > t 4 we have 


K + ( 2;4)= V u n (2)u n (4), (111) 

E n >0 


and we can write (110) in the form 

(-ief V «„(2) f f U n (4)$(4)K + (4;3)$(3)v + (3)dj 3 d u . (112) 

E n > 0 t 3 >t 4 

Next we have to subtract the amplitude for (d). This is 


-(-ief 2 y + (2 )// 

E n > 0 H'* * 4 



(4)$(4)K + {4- 3)$(3)u n (3) dr, dr 4 . 



The sum of (112) and (113) is antisymmetric with respect to interchange 
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of u n and v +9 as was required by the qualitative discussion based on Figure 
4.5. 

Now we use the matrix relation 


2 iI n (4)$(4)K + (4; 3)$(3)u n {3) 


spins 


Tr 


^ u n (Z)U n (4)$(4)K + (4- 3)0(3) 


(114) 


spins 


together with 


K + ( 3; 4) = V u n (3)u n (4). 


t, > t. 


(115) 


K> o 


and obtain for (113), 


( -ie) 2 v + (2) f f Tr[AT + (3; 4)0(4)tf + (4; 3)0(3)] dr 3 dr 4 . (116) 


t 3 > 1 4 


The restriction t 3 > t 4 can be removed provided \ is introduced, and the 
resulting coefficient of v + (2) is 


j(~‘ e ) 2 ff Tt[K + (3; 4)j(4)K + (4; 3)0(3)] dr 3 dr 4 . 


(117) 


Apart from the unity term, this is the lowest-order term in C v . Asym- 

metrization of higher-order terms in the perturbation expansion proceeds 
in the same way. In third order there are new terms in which the potential 
scatters v + and also acts twice in a closed loop. In fourth order the new 
terms scatter v + twice and act twice in a closed loop, act four times in a 
closed loop, and act twice in each of two loops. The net effect of all terms 
of the perturbation expansion is to give C v — exp( —L), say. 

Next we see how (117) and other terms arising from the perturbation 
expansion can be represented by path integrals. The loop r° is made up 
of r ± sections, as was the path T 21 in (47). The amplitude for a loop can 
be obtained from (47) by identifying points 1 and 2, and taking the trace 
of the righthand side, 


P B (f °) = Tr 


nw^i-i) 


i 


(118) 
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By summing loops in the slab 0 < t < T we obtain the path integral 

fp B (r 0 )® 3 r°. (ii9) 


The field B i is a simple exponential factor in each P B (r ± i i _ 1 ) factor on 
the righthand side of (118). Taken together these factors give the integral 
of round r°. We write 


P B (r°) — exp 



( 120 ) 


where 


P(f°) = Tr 


nw 


i , i 




( 121 ) 


It is remarkable that —L in exp( — L) — C v is just minus the simple path 
integral (119), 



where dl l is a coordinate displacement along r°. 

Equation (122) is the path integral, with the correct sign, for one loop. 
The path integral for n closed loops, /, m,..., is 




This may be compared with (68) for the particles a , 6,..., with the open 
paths r rv r 2 , 2 ,.... The factor \/n\ arises because the set of loops /, 
m,..., is counted n\ times in the summations. 

Before continuing with (123) we pause to note that the amplitudes P(r®) 
are determined in terms of the r ± i i _ x paths which were defined in §A3.1 
in terms of the K 0 ± propagators, whereas it is usual to involve the A7 
propagators. The use of the K + propagators is due to the trace property 
of P(r°). Thus the trace can be represented by a summation with respect 
to wave functions, 
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p { r°) = V a 




(124) 


where the summation is over normalized states for any momentum p. We 
require both E — ± \!p 2 + m 2 . The spacetime dependence of u cancels that 
of u because the wave functions are taken at the ‘'beginning” and “ending” 
of the loop, i.e., at the same point. Remembering that the r + sections of 
r° must be weighted by a \p + wave function, and the E~ sections by a ip_ 
wave function, the K + propagator is needed to keep correct accounting, 
exactly as in the derivation of (61). 

We have to generalize our path integral so that it includes both particles 
a, b ,..., and loops /, m ,..., and also so that mutual interactions are present 
as well as an external field. The path integral is to give the many-particle 
propagator K(\ \ 1; 2', 2; ...). The expression (69) for K applies when there 
are no loops. When there are n loops /, m,..., there is a contribution to 
K given by 

2-(-!)" / / P(fn)P(r 2 , 2 )... p(r°)P(r m °)... ex P [/(* + s)] 


X exp —1 


'(‘•i 

1 l'l 


B l da , 


+ 



/v 


2'2 


B l db l + ... + e(£ B % dl { + e <£ B l dm i + 

** r o ^v o 

1 l 1 m 


• ® 3 r ri ® 3 r 2 , 2 ... op3 r oq>3r m °. ... 


(125) 


The factor exp(/R) is the same as in §A3.2. The factor exp(fS) is new and 

gives the interactions between particles and loops and between loops and 
loops, through 



“ 2 e T 2 // 5 ( Tlm) dl' dm i . (126) 

l m 

The many-particle propagator is given by summing (125) for n = 0, 1, 2. 

The factor \ appears in the second term of (126) because the pair of loops 
(/, m) is counted twice. The particle a and the loop / are only counted once 
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in the first term. In the above formulas e is the electron charge, which implies 
that we are considering the electron vacuum, although even powers of e 
always appear in practical calculations, so the sign of e is irrelevant. 

The discussion now proceeds as it did in §A3.2. For every pair of particles 
we had paths a, b, and conjugate paths a', b', leading to the contributions 
F[a , b; a', b'], F[a, a'}, F[b, b% to the influence functional. For every pairs 
of loops (/, m) we now have closed paths /, m, and closed conjugate paths 
/', m\ giving contributions F[/, /n; /', m% F[l, /'], F[m,m']. These lead to 
vacuum diagrams—they have the effect of multiplying amplitudes by a 
phase factor exp iC , C real. Such factors disappear in the present theory 
because these contributions to the influence functional appear only after 
the response of the universe has been used, and we are then concerned with 
the calculation of a probability, not an amplitude. Loops by themselves are 
of no relevance. But the mixed contributions F[a , /; a\ /'] are of physical 
importance. They produce a scattering of particle a and contribute 
— 27Mc/sec to the Lamb shift. Using the method of §A3.2 we obtain 

F[a, /; a \ /'] 






- ff 8 M 2 AL) da> dl i + 



(127) 


This completes the program set out at the beginning of this section. We 
have now obtained the required influence functionals between particles and 
particles, between particles and loops, and the influence functional repre¬ 
senting the action of a particle on itself—the radiation reaction. All the usual 
results of quantum electrodynamics follow by applying the perturbation 
expansion to these influence functionals. 

We are now in an equivocal position, however, for just as we can obtain 
all the usual correct practical results of quantum electrodynamics, so we 
can obtain all the unwanted divergences in radiation correction processes. 
The critical question we now have to ask is whether these divergences are 
inevitable, or does the present direct-particle theory provide an escape from 
them, as it does in classical theory? We shall consider this question in the 
next two sections. 
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§A3.4. The Electron Self-Energy and the 

Radiative Correction to Scattering 

We come now to the analogue of the self-energy divergence in field theory. 
For a particle we have 



(128) 


which together with the response of the universe leads to the term 


- j e a 2 /f s +(q 2 aa) da* da t (129) 

in the influence functional associated with particle a. We discussed the 
interpretation of (128) at the end of §A3.2, and saw that to avoid ambiguity 
in the meaning of the line integrals points A and A should be distinct. In 
§A3.1 we defined the path steps by a set of points X if i — 0, 1,..., N. We 
left open the question of whether N should tend to infinity. If we decide 
affirmatively and arrange X i so that all steps tend to zero, A A cannot 
be avoided and the outcome is completely equivalent to the usual theory. 
We obtain all the usual divergences and require all the usual apparatus of 
renormalization theory. Here, however, we propose to examine the effect 
of preventing A — A. To preserve a four-dimensional notation we require 



i= 1,2, 3, 4, 


(130) 


where the \e l \ are to be regarded as small compared to the Compton wave¬ 
length. To make (130) physically meaningful, e l must be a vector associated 
with the particle. For a free particle we can try 



(131) 


where € is to be a length very small compared to m _1 . Thus we can choose 

e = 2 Gm, (132) 

the gravitational radius of the particle, giving me ~ 10“ 40 . Although we have 
not permitted the path steps to tend strictly to zero, the scale of the steps 
is nevertheless exceedingly small compared to any physical structures that 
have hitherto been considered. 

We emphasize that we are not seeking here to justify (130), (131), (132), 
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but to investigate their effect and to compare the resulting situation with 
the cutoff procedures of the usual theory. If indeed gravitation plays a critical 
role, then modification of the spacetime structure on the scale e is required. 
Only through such an investigation could one expect to understand c* more 
deeply. 

We show in this section that (130), (131), (132), lead to a finite self-energy 
even though the frequency in the three-dimensional Fourier integral of the 
8+(q 2 ) function tends to infinity. And ’n §A3.5 we shall show that the 
divergences in vacuum polarization, which are usually thought to be different 
in nature from the divergence of the self-energy, are removed by the same 
procedure. 

For simplicity of presentation we calculate first the self-energy for an 
electron with p = 0. Suppose at t = 0 we are given the wave function $l 0 
corresponding to positive energy m. We wish to examine the effect of the 
term (129) in the influence functional for the slab 0 < t < T. The wave 
function at point 2 on t = T is usually calculated from the perturbation 
expansion 



—imT 




2 


3i)% e imt * dr 3 dr 4 + 


• * • 9 


(133) 


where the integrations with respect to dr 3 , dr 4 , are over the slab. This permits 
points 3 and 4 to coincide, which is the situation we now wish to avoid. 
From (130) and (131) with p { — {—p, E ) = (0, m\ the evaluation of (133) 
is now subject to 




(134) 


To deal with the restriction (134) we separate the second term of (133) into 
two terms Q v Q 2 , such that t 3 > t 4 in Q 1 and t 3 < t 4 in Q 2 . 

For T > t 3 > t 4 > 0 write 


K + ( 2,3) 



1 


IF 

Eo> 0 ^2 


(y 4 E 2 - y * p 2 + m 


d 3 D n 

)(? —T ~ ^ + ’ ^' r 2 — *3 ^- 4 , ( 135 ) 

(277 ) 3 


K + ( 3; 4) 



1 


£>0 


2 E 


(y 4 £ 


dPp 

y • p + n 'i')e~ lE i t 3- t J + l P'^ x 3~ x 4 ) — — -, (136) 


(277) 
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where 

E., 2 = p., 2 + m 2 , E 2 = p 2 + m 2 . (137) 

The momentum representation for S + (q - : 34 ) is given in (102). Performing 
integrations with respect to d 3 x i , d 3 x 4 , d 3 p, d 3 p 2 , we get 

p, = 0, p - —k, (138) 

so that E 2 = m, E = V K' 2 + m 2 , and Q x becomes 



+ \)y i (y 4 E + yk + m)y i % 0 • 



p—imT+ i(m — E — K){t 3 — t 4 ) 



(139) 


Using symmetrical integration to remove terms linear in A*, is simplified 
to 



(140) 


where 



77 


/// 


2m - VK 2 + 


m 


2 


to> t 


Vk 2 + 


e il™-E-K)(t 3 -tJ K dKdt 3 dt r 


(141) 


m 


Similarly, we have 



(142) 


where 


2 


77 


i'Jf 


2m + Vk 2 + m 2 


tx> t 


3 


V7( 2 + 


m 2 


6> i(w + ^ t 4>A r dK dt 3 (143) 


It is usually argued that divergences in (9j, (U arise from high-frequency 
quanta, i.e., when K —oo. This is based on a calculation in which the 
integrations with respect to time are performed first. Now because of the 
time restrictions we integrate first with respect to frequency. For t 3 ^ t 4 the 
integrals with respect to K converge. To see this we examine the behavior 
of (141) and (143) at large K. Thus for K > m. 
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€i 


77 


fff (2m - K )e' - 2K>« 3 -1 4 > dK ^ ^ 


f 3 > 


77 



pimit } — 


<3 > <4 


00 

<4> J (2m - K)e~ 2iK{t 3-‘4>dKdt 3 dt 4 , (144) 


and 






2iKU 3-h> dK dt 3 dt 4 . (145) 


The actual values of fj. £ 2 , differ from these asymptotic values by finite 
quantities. 

Equations (144) and (145) can be simplified further by using the results 


oo 


A 7 ^ 0, 



lXx dx 


1 


00 


0 





xe lXx dx 


1 


o 


A 2 ' 


(146) 


Thus 


fi 


77 


77 


1 s,im(to-tJ 

— im 

1 

1 

1 c J * 

<3>*4 ' 

1 

CO 

_1 

4(r 3 

- / 4 ) 2 -1 

1 & im(tr,-t d ) 

im 


1 

1 c d 4 

t 3 <t 4 

1 

CO 

_1 

4^ 

CO 

-r 4 ) 2 J 




dtdt 4 . 


(147) 


(148) 


Interchanging the naming of / 3 and / 4 in £ 2 we have 


Ci + C; 


2^ 2 


77 



*3>*4 


cos m(/ 3 — / 4 ) sin w(/ 3 — / 4 ) 
im - : -|- / - 


^3 U 


4(^3 ^4) 


dt 3 dt 4 


lie 


77 


_ mT 

f dx f 

cos y 

sin y" 

+ 

L J 

0 rac 

V 

y 

4 v 2 J 


dy, 


(149) 


where 


x-mt 3 , y = m(t 3 -t 4 \ (150) 

and we have now taken account of the restriction |/ 3 — t 4 \ > €. 

For a slab with 1, which is the situation usually considered, we 

have 
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For me < 1, the main contribution to (151) comes from y < 1 when the 
trigonometrical factor ~ — f, and 

+ i 2 ~ -iAT, O 52 ) 


where 


V 2 

A — - m In (me). 

277 


(153) 


Since we are considering paths to have finite time steps e, it would be 
a stricter procedure to divide the whole time-range [ 0 , 7"] into many steps 
of length e and to work out i x + | 2 as a sum over steps rather than as an 
integral. It turns out when this is done that the result is the same as (153). 

The wave function at t = T to order e 2 is therefore given by 



3e 2 

277 


m In (me ) T 



(154) 


for T not too large, where 




= m A- A. 


(155) 


The effect of the response of the universe is to “change” the mass m to m' 
given by (155). The usual theory gives 

3e 2 t K 0 

A — — —min—, (156) 

277 m 


where K {) is a cutoff applied to the K integration, which is carried out after 
integrations with respect to t 3 , t 4 . Although (153) and (156) are similar in 
form, the cutoff in (153) can be given a physical basis, for example, in terms 
of the gravitational radius of the particle, whereas that in (156) is a mathe¬ 
matical variable from a Fourier integral and should not play a role in the 
theory. 

We wish next to consider the corresponding problem for a free particle 
with p l = (— p, A- E), p 7 ^ 0, E > 0, and 4-spinor L ?I. According to (130), 
(131), there are restrictions now on all the coordinates. This leads to a more 
complicated situation. It is possible, however, to simplify the work by noting 
first that if, as before, we restrict only the time-coordinate, by 1*3 *41 ^ £4 ’ 
the outcome for the wave function at point 2 on t — T is 


L ?I exp[//> • x 2 — i(E A- AE)T], 


(157) 
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where 2 


EAE= - (158) 

Ztt 

Since we are integrating invariant functions over a four-dimensional region, 
it is clear that if we had used 

1*3 * - */| > ^ (159) 

to restrict one of the space coordinates, ju = 1, 2, 3, instead of the time 
coordinate, we should have obtained ln(/?^c^) instead of ln(£c 4 ), and that 
if we had used 

1*3* - *4*1 > i= 1,2, 3,4, (160) 

lor all coordinates we should have obtained ln(/? 7 € l ). This is simply ln(me), 
so that in place of (158) we have the invariant result 

E AE — — m 2 In (me) = m Am. (161) 

• v ‘ 277 

Unlike the usual theory the so-called “wave function renormalization,” 
represented by the constant 


5 = - In (nu), 


(162) 


did not appear in the above work. The constant B does appear, however, 
if we seek the wave function at an intermediate time /, 0 < t < T. We again 
consider the case of a particle with wave function corresponding to energy 
m at t = 0. If we were to restrict our time-integrals to the interval (0, t) the 
analysis would be the same as that given above, but with t replacing T, and 
we should obtain t for the wave function at time t. But we must 

permit integrations up to T. This does not change Q v which has 
t 4 > 0. The restriction t 3 < t here arises because 5 + (^ 2 34 ) leaves the particle 
in a positive-energy state after the response is completed at point 3, and 
hence we must go forward in time from t 3 to t. But for Q 2 , in addition to 
t > t 4 > t 3 > 0, we have also to consider 

T> t 4 > t> t 3 > 0. (163) 


2 The proof of (158) is contained in Hoyle and Narlikar (1971). 
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It is this additional time-range that makes a contribution to the wave 
function at time /, over and above % 0 e~ xm t . 

The extra contribution is 




where 


(164) 



77 



T> t 4 > t > t 3 > 0 


77 



T>t 4 >t>t 3 > 0 


e im(t 3 -t 4 > j (2m + K)e lxK(x 3 dK dt 3 dt 4 


pim(t 3 —t 4 ) 




dt 3 dt 4 . (165) 


The intervals for t 3 , t 4 , have been kept open at t in the above formulas 
to indicate that t 4 — t 3 has a minimum value c. We require 

t 3 < t — ae, t 4 > t + /?€, (166) 


where a, /?, are positive numbers satisfying a + ft = 1. All possible choices 
for a, fi, lead to the same final result. Taking a = 0, and making the 
substitutions 


x = m(t 4 - t), y = m(l 4 - t 3 ), (167) 


we obtain 



(168) 


Provided / is not close to zero or to T, the \y 2 in (168) gives a large contri¬ 
bution when both * and y are close to me. After integrating with respect 
to y for this term we get 



Provided t is not close to zero or to T, the required wave function to order 
e 2 is therefore 



(170) 


The appearance of B within the slab is of importance when an external 
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potential acts on the particle. In general an external potential 5 can be 
represented by a four-dimensional Fourier integral 

W) = f (171) 

The scattering problem is to determine the amplitude for a state of momen¬ 
tum p 2 at t 2 = T given that the state has momentum p 1 at / 1 = 0. From 

the path-integral point of view we have to consider the effect of the phase 
factor 



(172) 


on this problem. 

Expanding (172) as a power series, the unity term leads to (154), which 
involves no scattering. The linear term in B t produces scattering, however. 
In most cases of physical interest, higher-order terms in B { give smaller 
contributions to the scattering than the linear term; so it is usual to restrict 
the problem to determining the effect of the linear term. It is found that 
only those components in the Fourier integral for which 

<J=P 2 - Pv <h = E 2 - E v ( 173 ) 

where E 2 = (p 2 2 + m 2 ) 1/2 , E 1 = (p t 2 + m 2 ) 1/2 , make a contribution to the 
scattering into states of momentum p 2 . For finite T the Fourier components 
making effective contribution can depart from the strict equalities of (173) 
by small terms of order T~ l . 

The scattering amplitude can be developed in a series of ascending powers 
of q\ Since b\ q\ appear in relativistically invariant forms, and since 
b(q) • q — 0 because of the Lorentz condition, the b l components can only 
appear in the combination while the q l components can appear as q' 2n + 2 
or as q 2n $, n = 0, 1,2,.... The most general form for the scattering ampli¬ 
tude is therefore seen to be 

OO 00 

fKq) 2 c nq 2n + \M4 - #(<?)] 2 D nf n > < 174 > 

n — 0 ft = 0 


3 The reader will note the clash between our notation B i for the external potential and B 
for the constant — e 2 In (me)/2m. 
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where the C n , D n , are constants. Detailed calculations in quantum electro¬ 
dynamics show that none of C n , n = 1,2,.. ., or D n , n = 0, 1, 2,..., involve 
the cutoff 1 n(K 0 /m). Because the present theory differs from the usual theory 
only in the form of the cutoff, not in the places where it occurs, the present 
theory leads to (174) with the same coefficients—except possibly for C 0 . It 
is well-known that the second series in (174) gives the anomalous magnetic 
moment of the electron, and the entire series without the C 0 term gives the 
Lamb shift. The observable effects of including 




(175) 


in scattering are therefore the same in this theory as they are in the usual 
theory. 

Although we are working only to the first order in it is important to 
notice that (174) includes all orders in the expansion of (175)—in the usual 
language we can have any number of virtual photons. Each order in e 2 is 
handled by Feynman-Dyson graphs. It is found that while individual graphs 
give cutoff-dependent contributions to C 0 , the sum of the contributions of 
all graphs does not involve the cutoff. The well-known situation to order 
e 2 is shown in Figure A3.1. The cutoff-dependent term from I cancels those 
from II and III. 

Once again we expect the same result for C 0 in this theory. Flere we use 
a method depending on the closed form of the electromagnetic phase factor 
in the path integrals, to show that C 0 must be cutoff-independent to all orders 
in e 2 , and hence it follows that the detailed calculations based on Feyn¬ 
man-Dyson graphs must lead to the cancelations described above. 

The C 0 term in (174) survives as the components q l tend to zero. Let the 
field B l be such that b^q) is zero except near q { = 0 for all i: that is to 



FIGURE A3.1 



The three situations that arise when the external potential and the response of the 
universe act once in the perturbation expansion. The infinite 5-term in I is exactly 
canceled by similar terms arising from 11 and III. 
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say, we collapse the Fourier coefficients b l on to the origin in </-space. As 

the region in which b^q) is nonzero shrinks, B t becomes effectively constant, 
and 


exp 



B; dx l 


exp[ — ieB • (x 


*1)]. 


(176) 


r 


21 


The electromagnetic phase factor arising from B i becomes path-independent. 
The gauge transformation 



(177) 


reduces the field to zero. Since a gauge transformation produces no physical 
effect, the situation is the same as for a zero field. There can be no scattering 
and therefore no cutoff-dependent terms in C 0 . 

Consider the case B x — (-B, 0) in further detail, writing/; = — eB. Let 
the four-component spinor at t — 0 again be % 0 corresponding to a free-par- 
ticle state of energy m. Suppose that the wave function on t — 0 has no 
spatial dependence. Then we have to consider % 0 e ip ' x i on t — 0 because 
of the e ip ' x * factor in (176 ). Before propagation to t — T the product % 0 e ip ’ x i 
must be separated into positive-energy and negative-energy components for 
±E — zt \/p 2 4- m 2 . Write 


% = %+ + <&_, (178) 

where exp[/(/? • x qz Et)] L ?l ± are solutions of the free-particle Dirac equation. 
Only L ?l + is propagated forward to t = T. Propagation subject to (175) was 
studied earlier in this section. After multiplying by e~ ip ' x * from (176) the 
wave function at point 2 on t — T is 

qi +e - iiE + AE)T , (179) 

where AE is given by E AE = m Am, AE being defined in (161). If the field 
had been zero, the wave function on t — T would have been 

% 0 e- iim + Am)T . (180) 

The difference between (179) and (180) appears to contradict what was 
said in the previous paragraph. The difference has arisen from the speci¬ 
fication of the wave function at t = 0. Propagation from t < 0 cannot give 
a wave function on t — 0 with spinor % 0 and without an x x dependence 
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FIGURE A3.2 

The simplest vacuum diagram 
leading to an observable result 
( — 27 Mc/sec in the Lamb shift). 


unless B — 0 for t < 0. We have therefore introduced a discontinuity in 

X 

the field at t = 0, a discontinuity that cannot be removed by a gauge trans¬ 
formation. Indeed, the discontinuity generates components in the Fourier 
integral (171) for b i away from the origin in g-space, which is just the sit¬ 
uation we set out to avoid. It is therefore necessary, if we wish to have no 
jc 1 _ dependence on t = 0 to specify L U + as the 4-spinor. Alternatively, if we 
wish to specify c ?/ 0 we must have an x t dependence on t = 0 given by the 
factor e~ ip ' x K In both these cases the constant field has no effect. 4 


§A3.5. Vacuum Polarization 
The amplitude for the process shown in Figure A3.2 is 

-(-ie 2 )(-ie) J J J u 2 (5)y { u 1 (5) 

S + (< 7 2 54 ) Tr[Y%<4; 3)B(3)K + (3; 4)] dr 3 dr 4 dr 5 , (181) 

the integrations with respect to <7t 3 , d t 4 , <7t 5 , being over the slab 0 < t < T. 


’It is well-known that an invariant finite cut-off procedure, such as we suggest here, is not 
consistent with a strict conservation of probabilities. Finite radiation processes involving 
convergent integrals are cutoff at c _1 instead of going to infinity with respect to frequency. 
This leads to a very small change of order trn in the calculated probabilities. The question 
of conservation of probabilities arises in this order. Our point of view is that this question 
cannot be discussed within the framework of Minkowski spacetime. The gravitational effect 
of an electron distorts spacetime to the same order as the question at issue. 

While this paper was in press, one expressing a similar point of view by Salam and Strathdee 
(1970) appeared. They discuss this same issue at greater length and in more detail than we 
have done. 
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The expression (181) is the lowest-order contribution of the vacuum in the 
presence of an external field B i to the scattering of a particle from state 
u x into state u 2 . This amplitude follows without difficulty from the general 
path integral of §A3.3. Since the integral of a constant potential around 
a closed path is zero, it is always possible to subtract an arbitrary constant 
potential from B { . We shall make use of this possibility at a later stage. 
The scattering is the same as that produced by a potential 

A\5) = ie 2 ff S + (<jr 2 54 ) Tr[y%(4; 3)$(3)K + (3; 4)] dr 3 d u . (182) 

This is the potential produced by a currenty i (5) given by 

jX 5) = ^□^‘(5) = ie 2 f Tr[y%(5; 3)$(3)K + Q-5)}dr 3 . (183) 

In this section we show that (183) can be evaluated in such a way that no 
quadratic divergence is present, and we show that the cutoff-dependent 
logarithmic term is of the same form as that encountered in the preceding 
section. Given p we can construct A 1 and hence obtain the scattering from 

—/e f u 2 {5)^(5)u 1 (5) dr 5 . (184) 

With a slight change of notation we write 

j k ( 1) = ie 2 f Tr[A' + (l; 1)($ 2 ~ $x) K +( 2 > 1)yJ<* 2 (185) 

where $■.. ljl v are the values of $ at points 2 and 1. The potential B 1 is 
constant and we are now making use of the possibility of subtracting a 
constant potential from the variable potential fy 2 . This step is useful in 
respect of the Taylor expansion 

4 - h = Y 4 + •• •), (186) 

where the derivatives B i l , B i lm , are evaluated at point 1, and q' — 
x 2 * — Xj*. We write the propagators in the form 

K + ( 2; 1) = (-f 2 + im)I + (q 2 ) = -2rfl + '(q 2 ) + iml + (q 2 ), (187) 

K + ( 1; 2) = 2rfl + '(q 2 ) + iml + (q 2 ), 


( 188 ) 
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where 

W) = -^2 W )> ( 189 > 

and 

I + (q 2 ) = - 47 % 2 ) + gf- #i < 2 W)- 090) 

In the Hankel function ^ is the positive square root of q 2 when q 2 > 0 and 
is the negative square root when q 2 < 0. The limiting form of I + (q 2 ) as 
q 2 —*- 0 is 

W)-»- - 4^W)- 

Inserting (186), (187), (188) in (185) we get 

AC 1 ) = ' e2 J TWV + ' + >mI + )(^B i t q l + ^B ilm q l q m + .. 

•(-2y r q r i+' + imi + )y k ]d*q. C 191 ) 

To simplify (191) we note that the integrations remove terms of odd order 
in q l9 and that the trace removes terms that contain an odd number of y 
matrices. Thus 

AO) = ( ie2 )’ B i,i m f Tr( — 2yPy i y r y k q p q l q m q r I + ' 2 

— y m-/ + - y i y k q l q m + .. ■ Jd 4 q. (192) 

Next we use 

TrCyAfc) = 4 t)\, 

Tr(Y p y' y r y k ) = Mv pi n\ + i p k r ) ir - v pr v'k)< ( 193 ) 

to give 

AC 1 ) = - 8ie2B i,imf ( 2 ^k - qW k )i+ 2 q l q m d*q 

- 2 ie 2 m 2 B i lm f r\ i k I + 2 q , q m d 4 q +- 


(194) 
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Only the first term in (194) is cutoff-dependent. The rest remain finite as 

c 0. Since it is the cutoff-dependent term we are seeking, we shall 
confine further attention to this first term. 

By symmetry we can write 

J I+' 2 q l q m q l q k d 4 q = \(ri ,m -q ik + rj li T) mk + ri lk ri mi ). (195) 

Contracting / against m and i against k gives 

§ {I+Xq 2 )q 2 } 2 d 4 q = 24A. (196) 

The integral on the lefthand side of this equation is worked out in Hoyle 
and Narlikar (1971). Using the results there, we have 

A = - 79^2 ln(w£ )‘ (197) 

Returning to the first term of (194) we note that 

( 2 q l q>c - qWk)i+ 2 q l q m d 4 q 

= 2 ^ B i,i m ( r i lm v , k + + v l kV mi - 3 V m i?Y) 

= -4A UB k + 2 \B\ lk + 2XB™ km 

= -4AD5 k , (198) 

since B m km = B m mk = 0 and B 1 lk = 0. Using (198) we get 

j k (l) ~ 32ie 2 \OB k . (199) 

With (197) for A, 




( 200 ) 


The d’Alembertian of B k is taken at point 1. Hence there is no vacuum 
current except at the sources of the external field B k . In the direct-particle 
theory this means there is no polarization current except at the particles, 
if A is the particle current, we have = 477,4 and 



2e 2 


ln(mt)J k (l). 


377 


( 201 ) 
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The result that the polarization current occurs only at the particles holds 
also for the finite terms not considered explicitly above. 

The usual theory gives 





( 202 ) 


in terms of a momentum cutoff M. In §A3.4 we saw that e in this theory 
replaces the photon cutoff K 0 of the usual theory. Now we see that € also 
replaced the momentum cutoff of the usual theory. There is no requirement 
for two distinct modes of cutoff. 

We also note that the quadratic divergence, encountered in most treat¬ 
ments of the usual theory, did not appear in the above work (see Hoyle 
and Narlikar, 1971). 

APPENDIX 4 (a note to §6.3) 

We are required to show that m* = S\p* = S2~ 3/2 \p, makes the Dirac 
equation conformally invariant. The proof of this important result was 
deferred in the main text because the work of §8.3 is needed for it. Using 
the representation in which 



(203) 


the Dirac equation for unstarred quantities takes the form 



(204) 


with respect to the metric 


ds 2 = g ik dx 1 dx k . 


(205) 


A conformal transformation to 



(206) 


is accompanied by 



(207) 
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We are required to prove that 


satisfy 





i • * 

4 - im 




(208) 


(209) 


It will be sufficient for this purpose to show that 



a: i 


+ im v 


*--*13 



( 210 ) 


since a similar proof applies to g* i a j 3 V* li . i + im*u* a . 

The spinor derivatives in curved space must be interpreted in accordance 
with the discussion of §8.3. Thus 



a; i 



( 211 ) 


where 



( 212 ) 


For the lefthand side of (210) we have 


g* iap u* i + im*v* 13 




d 


- (S “ 3/ X ) 


_dx 


L*V ai S2-^u fJ 


+ iQ 5/2 mv@, (213) 


in which 
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(214) 
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Substituting (214) in (213) gives 


i? 5/2 (g 




a; i + 


g 


ia Pu„.i + imv 13 


a\ % 


4 

3 

2 


g l a\g myX 


K g ia0 Uy ( 


0 In Q 0 In Q 

glm ^.A "1“ girrt 


dx 


dx 


g 


ia %„ 9 ln ? + 


a dx 4 


dx 


gil 


dlntt 

dx m 


(215) 


The terms in 0 ln £2/dx l are easily seen to give 


3_ fxiaPij d tn i2 
2* “ dx 1 ’ 


(216) 


while the remaining terms in ln are 


1 7 y\ iafi 0 lit *12 

4 - g a\gi g Pu y 


+ 4 g ia ig nm g ini3 lty d[nS? 


4 


dx 


m 


(217) 


Using 


g^yXpr iocfi — 2^ x/? 


it is easy to see that (217) becomes 


(218) 


1 rrl . 0 In 1 0 myii tl 0 In *12 _ _3_ criafi 0 In il2 


2 


<*X 


7 dx 1 6 y dx m 


2 S ‘ a dx 


f-, (219) 


which cancels (216). Hence the righthand side of (215) vanishes because 
of (209). This establishes the required result. 


APPENDIX 5 (a note to §8.2) 

In Appendix 3, the contribution of the electromagnetic interaction 

— ie j B i dx\ (220) 

operating in a spacetime slab t 1 < t < t 2 , to the wave function at point x 2 . 
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t 2 , was found to be 


-ie f K + (2; 3)$(3)^ + (3) dr 3 , (221) 

the spacetime integration with respect to dr 3 being taken through the slab 
in question. In §8.2 we are concerned with the corresponding problem when 
the elementary interaction —ieB i dx i is replaced by \y i dx i . This gives 

\ J K + (2; 3)(y iy ^^3) dr 3 = f K + (2; 3)* + (3) dr 3 (222) 

by the same argument as that which led to (221). 

The mass interaction, unlike the electromagnetic interaction, keeps the 
same free-particle state xp at all t, so that the total interaction for §8.2 is 

[ \p*(2)K + (2; 3)^ + (3) dr 3 d 3 x 2 

= J ^(2)y 4 A' + (2; 3)\p + (3) dr 3 d 3 x 2 
= j M3) + ^ + (3) dr 3 = f xp(3)\p + (3) dr 3 , (223) 

xp + being the positive-energy part of ip. 

In §8.2 the wave function was written as \p, it being implicit there that 
the particles under discussion had positive-energy states. Hence (223) in the 
notation of §8.2 would have \p + replaced by \p, 

j xp(3)ip(3) dr 3 , (224) 


which was the quoted result. 
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we are required to prove that the choice 
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leads to 



(226) 

(227) 


(228) 


The simplest procedure is to take advantage of the covariance of g l a p.j< 
It follows from this that, if for a general point X we can show that g l a p.j = 0 
for a particular choice of coordinates, then the same result holds for all 
coordinate systems. Choose coordinates such that for point X(x j ), 



UKl = 1,2, 3,4, 


and take the local Lorentz frame so that 


(229) 



(230) 


In any small step x j — x j + 8x> we then have 
However, 8g\p is not in general zero; 8g i a n is 
8 as 


8g ik = 0 to the first order, 
given by (92) of Chapter 




1 

2 Shaft 



+ 0 


a 


Ypi ■ _L Q-Qpi 
6 yft ^ u ft 5 


a a ’ 


(231) 


where 8 a y + 0 a y can be any infinitesimal unimodular transformation, i.e., 
satisfying 0 y y = 0. Hence here 

^g\ft — 0a y °'y'ft + Qft X °'a\- (232) 

Since the f m u are zero, we therefore get for the variation x* x j + 8x j , 




^gl^g^a^Xft 
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o' 4 (0 y “ l 

av\ 


\ 0 y/3 


+ 6 p *o\ K ) = 0, (233) 
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where we have used 6 y y = 0, 0 X X = 0, in the last step. This completes the 
proof, since (233) holds for arbitrary 8x j and therefore requires • = 0. 


APPENDIX 7 (a note to §8.4) 


The variations considered in (231) to (233) were with respect to a change 
from the point x j to a neighboring point x j + 8x*. Also in §8.4 we were 
concerned with a variation of geometry, g ik —»- g ik -f- 8g ih c . Notationally, 
we shall now use 8 in relation to the latter variation instead of to a change 
of the spacetime point. A change of geometry has the effect of altering spinor 
derivatives, u a .j u a]j -f 8u a .^ according to 



8U aj u y . 


(234) 


In §8.4 we took 



Vji 8v a .j)V^gd*x 


(235) 


to be zero; here we prove it. 

As a first step we calculate Su^.j at a particular point. Choose coordinates 
so that (229), (230), hold at the point in question, and by an appropriate 

spin transformation arrange for 

# 

do jot P . 

° = 0 for all /, /c, a, ft (236) 

dx k 

at the same point. The variation 8L y a j arising from (95) of Chapter 8, 

Sg iap = \ gk ai3 Sg ik , ( 23? ) 


again at the point in question, is then given by 


= T 


( 0 ' a X CT 


m 


y i sr m „ + a, yX 


«g'«X 


H 1 “I dx , 


(238) 


Omitting vanishing terms 



yx sr 




(239) 
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so that 




1 
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° l a \° myx 


Mglrn , 
dx j dx 


jm 


3 «& 


dx 


Ji 
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) + a i n ° 


d8g_ 

ma\ g i 


lm 


(240) 


The last term in the square brackets can be written in the form 

• 0 

a iy\ a m aigugkm ___ = a l ^ x a m ai g u — (g km Sg ik ) 


dx 1 


3x 


= -o lyX a m aigil ^-j(g ik Sg km ) 


dx 


n lyi n m . Sim 

“ x dx’ 


(241) 


which cancels the d(8g lm )/dx j term arising from (239). We therefore get 


SLy aj = l r a l ai a m y^ dS§im dSgi 


8 


V dx 1 


jl 


dx 


m 


8 


^§jl;m )» 


(242) 


where in the last line of (242 ) we have restored a generally covariant form 
for 8L y aj . Using (234) and (242), the 8u a;j term in (235) becomes 



- 8 gjm-,l) u y u /3^-g dix - 


(243) 


Integrating by parts, setting 8g ik = 0 on the boundary of the region of 

• • 

integration, and remembering that g iafi . k = 0 for all j, k , «, /?, (243) is easily 
shown to be equal to 




)-, m g iai3 (g lyi g m ai 


g myX g l ai)V^gd 4 X. 


(244) 


It is not hard to see that the integrand of (244) is real. The term 


S gjl( U Y U py,mg iafl g lyK g m a\ 


(245) 


is easily seen to be equal to its complex conjugate: all that is needed is to 

take account of the symmetry of 8g ik and to rearrange the dummy indices 

♦ • 

after taking the complex conjugate, remembering that g jafi and g i(ia are 
complex conjugates. For the term 
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(246) 


we note a relation 



(247) 


in which is antisymmetric in j, /. Because is multiplied by the 


symmetric 8g jlf we are left with —^gji(u y u^). rn g myX g jl 8^^, i.e., with 



(248) 


and this also is real. 

Returning to (235), the two terms involving the spinor u are complex 
conjugates subtracted from each other. Since the above discussion has shown 
that the 8u a .j term is real, it therefore follows that the Sup.j term must also 
be real, and that the subtraction gives zero. A similar argument shows that 
the two terms involving the v spinor also cancel. Hence (235) vanishes, as 
we required it to do. 
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